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Editors’ Preface 


The papers have been divided, necessarily somewhat arbitrarily, into two parts 


Part A: The Scientific Papers 
Part B: Historical, Philosophical, and Socio-Political Papers 


Within each part, the papers have been divided by subject, and within each 
subject printed chronologically. With some exceptions, every scientific paper is 
reprinted in its original form. One class of exceptions consists of papers that 
are simply translations into Hungarian from German or English; they are omit- 
ted, but listed in the bibliographies. Scientific papers originally in Hungarian 
have been translated into English. Some of the papers of Volume V/ Part III, 
Articles, Reports, and Memoranda on Nuclear Energy, have been reset and the 
figures redrawn. The originals were declassified reports, some in nearly illegible 
shape. Some reports and patents in Volume V/ Part III and Part IV are listed 
by title only. In contrast to the scientific papers where the coverage is essentially 
complete, in Part B, a selection has been made. We believe it is representative 
of Wigner’s far ranging concerns. The five books in which Wigner was involved 
as author, co-author, or lecturer are not reprinted in The Collected Works, but 
are noted in the annotations and bibliographies. 

After the publication of Volume I, V, and VI of The Collected Works, 
Eugene Wigner died (1 January 1995). The heirs of his estate have donated his 
scientific papers and correspondence to Princeton University where they are 
available to researchers at 


The Eugene P. Wigner Papers 

Manuscripts Division 

Department of Rare Books and Special Collections 
Princeton University Libraries 


Springer-Verlag and the co-editors of The Collected Works intend to give to 
Princeton University for inclusion in this collection, whatever scientific papers 
are left over after the publication of The Collected Works is complete. 


Jagdish Mehra 
Arthur S. Wightman 
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PART I 


Physical Chemistry 


Wigner on Physical Chemistry 


Annotation by Nandor Balazs 


I. Studies in Chemical Reactions 


Though young Eugene Wigner’s original interests did lie in the direction of 
physics, he nevertheless started his university studies reading chemical engi- 
neering. He himself describes the reasons for this choice as follows [1]. “Father 
once asked me, “Tell me, son, what do you want to be?’ After thinking briefly 
I replied, ‘Daddy, if I may be quite honest, I would like to become a scientist, 
preferably a physicist.’ Father had expected this answer, and said, ‘Tell me, 
how many positions are there in this country for a physicist?’ I thought, and, 
exaggerating a little, answered, ‘Four’. Father, in turn, ‘And do you think that 
you are sufficiently clever to get one of the four?’ Thus, we then decided that 
if I go to the university I should become a chemical engineer.” After a year 
at the Technical University in Budapest (1920-1921) he continued his stud- 
ies at the Technical University, Berlin. Upon his arrival to Berlin his friendly 
and professional ties with Michael Polanyi increased. When, in 1923, Polanyi 
became a departmental head in the Kaiser Wilhelm Institute of Physical and 
Electrochemistry he also returned to his earlier interest in chemical reactions. 
Already before, during 1920 Polanyi came to the conclusion that the theories 
then in existence cannot be correct; and he even questioned whether the laws 
of physics known at that time were adequate to construct a correct theory of 
chemical reactions. He communicated these concerns to his close friend, Eugene 
Wigner, who had a very high opinion of Polanyi. It seemed thus natural that 
Wigner decided to write his doctoral dissertation on the problem of chemical 
reactions. | 

While working on his Thesis he also collaborated with others in widely dif- 
ferent fields. His first published paper (1923, with A.Szegvari [2]) deals with 
the concentration dependence of the dielectric constant of colloidal vanadium 
pentoxide, using a simple model to describe the polarizability of a colloid par- 
ticle. The next one, [3], with H. Mark, is a beautiful and detailed analysis in 
X ray crystallography, determining the crystal structure of a sulfur. This is an 
impressive achievement, since the unit cell contains 128 atoms! 

During the next year, 1925, he received his degree from the Technische 
Hochschule in Berlin. Wigner’s copies of the Thesis did not survive. One in 
Budapest was destroyed in 1945 during the turmoil of the Soviet occupation, 
the other copy in Berlin was burned in the bombing. 
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The times were auspicious to consider the theory of chemical reactions. 1925 
was the year Heisenberg, and Dirac constructed the foundation of quantum 
mechanics, and the following year brought Schrödinger’s celebrated series of 
papers on wave mechanics. And Polanyi was indeed correct: it required new 
laws of physics to achieve progress in the proper understanding of chemical 
reactions. 


Already Boltzmann noticed that the theory of chemical reactions falls natu- 
rally into distinct parts [4]. One is the description and analysis of the mechanism 
which leads to a reaction among the colliding constituents, or to the dissoci- 
ation of a constituent; the other part is the statistical mechanical description 
of the collection of particles which undergo the reaction. The reaction analysis 
consists of a) describing the elementary reactions in terms of the known quan- 
tal properties of the constituents; b) combining these elementary reactions in 
order to express in terms of them the actual reactions. These steps absolutely 
required the advent of quantum mechanics, since even the notion of valence 
forces were incomprehensible without it. However, most reactions envisaged 
took place in dilute gases and at normal temperatures; thus in the statistical 
mechanical description the usual paraphernalia of classical statistical mechan- 
ics could still be used, (these being in fact the semiclassical approximation of 
the more correct quantum mechanical results). 


On and off Wigner was concerned with the theory of chemical reactions 
for about fifteen years, from 1925 to 1939, writing eleven articles during this 
period. These are the following: [5], [6] with M. Polanyi; [7] with H. Pelzer; [8]; 
[9]; [10] with L. Farkas; [11] with H. Eyring; [12]; [13] with J.O. Hirschfelder; 
[14], [15]. One can rightly claim that these papers laid the foundation of the 
modern theory of chemical reactions as we know it. 


Paper [5], written together with Polanyi, is based in part on his Berlin 
thesis, and a brief account of it is also given by Wigner in Polanyi’s Obituary, 
written by him, ([16], p. 421). In it Wigner and Polanyi anticipated some of the 
most basic results of the yet to be invented quantum mechanics, to wit, that the 
energy spread Ae is related to the life time Ar through AeAr = h, and that the 
conservation of angular momentum is only valid in the formation of molecules 
for values of the angular momenta that are multiples ofh/2r. (Wigner once said 
jocularly ([1], p.7) “that he has been accused by several people inventing here 
Heisenberg’s uncertainty relations”. This seems to be also the result to which 
Wigner’s criptic remark refers in [17] that the time-energy uncertainty relation 
was known before Heisenberg’s.) Other parts of the paper are not yet quantal 
at all; among others, it studies monomolecular reactions by considering the 
probability of the internal energy fluctuation of a molecule exceeding a given 
threshold. This naturally leads immediately to Arrhenius factors in the decay 
rates. 

With the arrival of quantum theory further advances could be made. The 
basic step forward was provided by F. London [18] who clearly recognised that 
in a chemical reaction two different possibilities can arise. 1) The quantum 
states of the electrons do not change, but the chemical constitution is altered; 
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this situation is conventionally called the adiabatic case. 2) Both the quantum 
states of the electrons and the chemical constitution undergoes a change; the 
diabatic case. (In the discussion of the adiabatic case London makes full use of 
the so called Born-Oppenheimer approximation, before the Born-Oppenheimer 
paper!) The adiabatic situation is the usual one; in the hands of Wigner, and 
others it has led to the development of the now standard Transition State 
Theory, abbreviated often as TST. (The non-adiabatic theory was first studied 
also by F. London [19]. However, already in 1929 Wigner, with von Neumann, 
draws attention to the difficulties which can arise if adiabatic energy surfaces 
meet; [20].) The first appearance of TST is in [7]; it is then applied to reactions 
involving ortho- and para-hydrogen, and deuterium; [7], [8], [9], [10]. Paper [11] 
is a popular account of the TST, while paper [14] is an extensive survey of 
the developments up to 1937. It still can serve as a superb introduction to the 
theory. 


The basic idea is very simple. In a dilute gas we may consider only the 
atoms actually partaking in an elementary reaction to form the quantal system 
under consideration. In general, the nuclei of the atoms move slowly compared 
to the electrons. This allows one first to neglect the kinetic energy of the nuclei 
altogether, and compute the lowest energy state of this system, considering 
the nuclei as fixed. The adiabatic assumption now states that the nuclei will 
actually move as if this energy were the potential energy of the nuclear mo- 
tion. Since the nuclei are heavy, their motion can be described classically in 
the configuration space of the nuclei by the motion of a configuration point 
on this energy surface. This potential energy landscape is usually quite com- 
plicated. However, for many reactions, there will be a valley connecting the 
initial and final configurations, with a saddle point in the valley. The saddle 
point configuration is considered to be the transition state, and the value of the 
energy at the saddle point is the activation energy of the reaction. The transit 
of the configuration point across the saddle point describes the reaction, and it 
usually does so using classical dynamics. This picture may not be always valid. 
Firstly, the adiabatic hypothesis may fail. Secondly, it may be necessary to use 
the whole phase space (and not only the configuration space!) to describe the 
dynamical situations which correspond to the transition state, (as for exam- 
ple in association reactions in three body collisions). Finally, the presence of 
quantum effects may modify this picture. Wigner returned several times to the 
general discussion of these limitations. (See, for example the review [14], and 
the discussion after [15], including J. Franck’s remarks.) 


More specifically, in [13] he addresses himself to the limitations which arise 
using a classical passage over the quantal energy surface, and then to the use 
of some corrections to salvage the classical picture. There are two points con- 
sidered. Firstly, the classical definition of the transition state requires that the 
energy landscape be sufficiently flat across the saddle point; secondly, even then 
the classical passage across the saddle point must be corrected to take into ac- 
count the quantal reflections, and the tunnelling that may occur. In [13] Wigner 
estimates the condition on the flatness of the barrier, and the size of the quan- 
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tal reflections so as to allow this theory to be still applicable. The possibility 
of chemical reactions through tunnelling is already mentioned in [6, p. 441] this 
being the first mention in the literature of a “sub-barrier reaction”! In a re- 
markable note, [8], all the quantum corrections to the flux across the barrier are 
computed using the just invented Wigner function [21]. These corrections are 
then utilised in the description of ortho-to-para-hydrogen conversion. Ironically, 
the corrections make the agreement worse! 


There are, however, reactions for which the TST in its present form can- 
not be sensibly applied since the description of the reaction requires the whole 
phase-space of the atoms and not only their configuration space. Such a reac- 
tion is the association of two atoms in a three body collision. Wigner observed 
that this is more a problem of statistical mechanics, than that of quantum me- 
chanics, [15]. In the phase-space of the constituents of the three body collision 
a critical hypersurface can be defined which separates the bound states of the 
associated atoms from the free states, while the third body resides at infinity. 
The reaction rate constant is then expressible in terms of the average crossing 
rate across this phase-space surface in the required direction. 


These papers had considerable direct, and indirect effects on the scientific 
community. 


The direct effect was mainly on the chemical constituency, since Wigner’s 
work essentially mapped out the fruitful directions for any future progress. It 
provided a convenient structural division of the problem into statistical me- 
chanical and quantal parts; it assessed the approximations and shortcomings 
inherent in the division itself, and in the approximate calculations used in each 
part. Even today, very little needs to be changed in the excellent review [14], 
published in 1938, to provide an introduction to the modern theory of reaction 
rates. In addition to the general theory he also gave explicit calculations for 
actual experiments; see, for example his review article [10] (with L. Farkas) on 
the rates of elementary reactions in light and heavy hydrogen. The disparities 
between the actual and computed results were also put to constructive use to 
refine different models. (See, for example [15].) The basic insights produced in 
the papers have even been utilised in recent work on chaos, as in the study of 
fluxes through gaps in cantori [22]. The more recent developments in the the- 
ory of chemical reactions occur either in the statistical mechanical treatment, 
or in the quantal treatment. For the former see the review article [23]; for the 
quantal development see [25], [26], [27]. 


The indirect effect, particularly in nuclear physics was even greater. The 
late thirties saw the rise of nuclear physics and the study of fission. All the 
basic results of the theory of chemical reactions, as the use of adiabatic energy 
surfaces, the compound nucleus as a transition state, the possible tunnelling 
through the transition state, diabatic effects, ets., etc. could immediately be 
applied here. (See, for example the basic paper on nuclear fission, N. Bohr 
and J.A. Wheeler, [28], and a more recent survey [29].) In fact, the theory 
constructed in the 1925 paper [5], is also valid for nuclear association reactions. 
Similarly, nuclear fission resembles monomolecular chemical reactions. Thus the 
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theory of chemical reactions was a natural early training ground for Wigner 
when turning to nuclear physics; and was so used by many others as well. 


II. Studies in Statistical Mechanics 


While in quantum theory Wigner’s work encompassed both the foundations 
and the applications, in statistical mechanics he took a more pragmatic view. 
There are no publications on the foundations, as ergodic theory, approach to 
thermal equilibrium, origins of irreversibility, etc.; he used statistical mechanics 
as a given tool to obtain answers to actual problems in physics. (There is one 
important exception, the invention and use of Wigner’s Function. Because of 
its importance this will be discussed in the adjoining Section III.) 

His efforts fall naturally in two major parts, classical, and quantal. 

Classical statistical mechanics is used practically throughout in the theory 
of chemical reactions. As discussed there, he clearly distinguished between the 
quantal description of the atoms partaking in the elementary reaction, and the 
statistical mechanical description of the reaction-process itself. The latter was 
basically a classical one, corrected for quantum effects. For example, such a 
quantum notion as a chemical reaction is specified as either a passage of a 
point in configuration space across a saddle point [7], or across a hypersurface 
in phase-space [14]. Essentially all statistical mechanical concepts were given 
in terms of the canonical or microcanonical description. 

The full power of classical phase-space methods were exploited in [30] where 
a general discussion and derivation of the Onsager-reciprocal relations are given. 
Onsager’s original papers were hard to grasp. There, Onsager does essentially 
two things, 1) he normalises the thermodynamical fluxes and forces in a precise 
way; 2) next, he shows that if irreversible processes are described as linear re- 
lations between the thermodynamical fluxes and forces so normalised, then the 
coefficients appearing in these linear relations obey simple symmetry proper- 
ties, the Onsager relations. The proof makes use of two assumptions; a) that the 
decay of a fluctuation proceeds the same way as an irreversible process; b) that 
the microscopic motion of the underlying dynamical system is time reversible. 
In [30] Wigner explores the relation between the macroscopic description and 
the microscopic, phase-space description of a thermodynamical system, and 
shows that the principle of detailed balancing holds in the macroscopic de- 
scription as well if it holds for the microscopic one. Then he replaces the usual 
linear differential relations describing the irreversible process by linear integral 
relations, and shows that by virtue of the macroscopic detailed balancing the 
kernel of the integral equation is such that the Onsager relations immediately 
follow. 

{Let Q denote all the macroscopic coordinates needed in the macroscopic 
description. Then let P(Q) be the normalised probability density of the Ğ 
values, and T,(Q sa)! ) the transition probability from O to Q’ in time t. 
Then the linear equations of motion are written as f dQ’ T:(Q = )Q; = 
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>> e(t)¥ Qk. The Onsager relations hold if € is a symmetric matrix. Wigner 
now shows that as a consequence of macroscopic detailed balancing the operator 
P(Q); (Q — Q) P(Q’)-1/2 is a symmetric operator in a real Hilbert space 
of quadratically integrable functions, and this implies the Onsager relations. 
Since f dQ’T;(Q — Q’) = 1, this operator has as its eigenfunctions P(Q)" 2 
with eigenvalue unity, and the integral equations show that the submanifold 
spanned by Q; P(Q)/? is invariant under the action of this operator. } 

It is well known that the two body distribution functions of equilibrium 
statistical mechanics can be approximately computed from an integral equa- 
tion. In [31] Wigner and Ufford consider points on a circle interacting pairwise 
with the repulsive potential v(x) = —kT lnsin? rx/L, where x is the separa- 
tion between two points. They then compute both the exact pair distribution 
function and the solution of the integral equation. The solution of the integral 
equation differs from the exact solution. 

The literature of the integral equation method has since grown tremen- 
dously [32], and other exact results are also known [33]. 

Wigner’s work on the theory of random matrices (that was utilised here) 
has also other links with statistical mechanics. These will be discussed inde- 
pendently. 


IIT. Studies on Wigner’s Function 


Soon after the invention of quantum mechanics it became natural to study the 
quantum corrections arising in equilibrium statistical mechanics. Elementary 
arguments show that they will be more and more important as the tempera- 
ture of the system is lowered. Consequently the experimental efforts increased 
to measure at low temperatures the two standard experimentally accessible 
functions of thermodynamics, the specific heat of the system as a function of 
the temperature and pressure, and the pressure as a function of the density 
and temperature. 

The question then arises as to the quantum corrections to these two func- 
tions, or better, to the thermodynamical potential of the system. However, this 
question is only part of a more general, and more fundamental query. 


On what mathematical object, by what mathematical method can one display 
the existence of the classical limit; and what expansion scheme, in terms of 
what parameters will systematically provide the correction terms to the classical 
results ? 


For several reasons the answer is by no means obvious. 1) The most ef- 
ficacious display of classical dynamics is given in the phase-space, a manifold 
which has no natural equivalent in quantum theory, and thus would appear 
only in the classical limit, but not before, or during the limiting process. 2) The 
intuitively most natural, and mathematically most accessible quantum objects 
(as wave functions, matrix elements, transition probabilities etc.) usually refer 
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to particular representations, while the experimentally observed corrections are 
clearly representation independent. 3) While it is obvious that the deviation 
from the classical results will be more and more negligible as Planck’s con- 
stant tends to zero, the proper choice of the dimensionless small parameter, or 
parameters, is by no means clear. 

In order to answer these points a particularly attractive approach was 
conceived by Wigner, [21], for non-relativistic quantum theory in the ab- 
sence of spin. His subsequent papers on this topic are [8], [14], [34], [35], and 
[36] with R.F.O’Connell, [37] with R.F.O’Connell, [38], [39] with M. Hillary, 
R.F. O’Connell and M.O. Scully, [40] with R.F. O’Connell, [41] with Y.S. Kim, 
[42] with Y.S. Kim, [43] with Y.S. Kim ([38] demonstrates the immense popu- 
larity of the original memoir. It is from Citation Classic, and informs one that 
between 1961 (the year Citation Index started) and December 1983 over 655 
publications referred to this article.) 

(Identical suggestions to Wigner’s were put forward by Dirac in 1930 [44], 
and Heisenberg in 1931 [45]. These were ad hoc proposals without exploiting 
the full scope of the idea.) 

Wigner considers the density operator and associates with this operator a 
function of real parameters. (The association is invertible and one to one. For 
Hermitian operators the resulting function is real.) The parameters appearing 
in this function can be so chosen that in the classical limit they are identical 
to a particular set of classical canonical variables, i.e. to Cartesian rectangulars 
for the coordinates, with canonically conjugate momenta; also, in this limit the 
associated function becomes identical to the classical phase-space density in 
this particular set of variables. In the physics literature this function is called 
Wigner’s function, while in the mathematical literature it is named the Weyl 
symbol of the density operator. (See the later discussion on pseudodifferential 
operators). It turns out that even outside the classical limit these parameters 
may still be thought of as some mock coordinates and momenta, and Wigner’s 
function as some sort of mock phase-space density. This comes about since in 
the present scheme the computation of the correct quantal expectation values 
follows the same formal operations as the evaluation of standard, classical sta- 
tistical mechanical averages, for any value of fi, (thus not only in the classical 
limit). 

The association proposed can be expressed as 


Aw (p, q) ~ Trace {AAw (p, q)} ’ 


with its inverse 
A(Pop; Gop) S J dpdqAw (p, q) ^w (p,q) . 


Here p and q are the mock momentum and coordinate parameters, Aw (p, q) 
the function associated with the operator A. The singular operator Aw (p, q) 
mediates the association; (I do not indicate the dependence on the operators, 
but only on the parameters.) It is a sort of hybrid Dirac ö whose argument 
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is part operator, part c number, of the type (Pop — P, Gop — q), but suitably 
symmetrised in the operators. It is given by 


Aw(p,q) = | dudu expliu(pon - p) + iv(dap - Q/h. 


(The coeffficients in both formulae depend upon the normalisation used.) 
The expectation value of A in the state described by the density operator 
pis 
Trace Ap = I dp dq Aw (p, q)ew (p,q) - 


(Wigner’s original paper expresses his function differently. The convenient Aw 
operator was introduced by de Groot; see his Montreal lectures, or the review 
by Balazs and Jennings, both quoted later.) 

Thus, the exact quantal expectation value can be written as in classical 
statistical mechanics, since the right hand side of this expression is formally 
the same as used there. (The similarity is only formal, since pw (p, q), the ana- 
logue of the classical probability density, will in general not be positive as the 
classical probability density would be. In fact, there is an equivalence only be- 
tween expectation values and averages over associated functions using pw as 
the weight. Outside the classical limit the mock parameters p and q and their 
functions on the mock phase-space only “resemble” their classical equivalents.) 

Wigner did not exhibit this relation in its full generality. In particular, he 
did not discuss (because he did not need it) that all the functions appearing in 
the mock phase-space average must be generated by this association, and not 
only the function associated with the density operator. 

Later, in 1949, Moyal [46] pointed out that Wigner’s construction is iden- 
tical to the particular association proposed by H. Weyl in 1927 [47] between 
classical dynamical quantities and operators. Weyl used this association in re- 
verse, as a general prescription to construct operators associated with classical 
dynamical quantities. The so constructed operators correspond to a special 
ordering of the p and q operators, to the so called Weyl ordering. 

Wigner finds the asymptotic expansion of Wigner’s function by showing 
a) that if the density operator satisfies its quantal equation of motion, Wigner’s 
function satisfies an integral-differential equation; b) that in a formal expansion 
of the integral in a power series of A, the A independent term leads to Liouville’s 
equation, while the others contain formally the quantum corrections; c) that 
the solution of the full equation can be computed explicitly as a formal power 
series in A. The conditions on the nature of the convergence is left open. A 
necessary condition is that the forces of interaction should be slowly variable. 
Wigner also points out that the expression as it stands does not take into 
account the exclusion principle. 

Much of the later work, both by Wigner, and by others was in the following 
directions. 


a) General reviews 
b) Properties of the mock phase space density 
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c) Questions of uniqueness of the association 

d) Relativistic versions 

e) Versions applicable if a spin variable is also present 
f) Applications in physics 

g) Mathematical studies of the problem of associations. 


Wigner has contributed much to the study of these points as well. The first 
practical applications were in [8], where quantum corrections to chemical reac- 
tion rates are studied; and in [21], where the leading quantum corrections to the 
free energy are derived in general, (but without taking into account the effects 
of statistics.) The latter were then used to evaluate the quantum corrections 
for a low temperature He gas, and compared with the experimental results. 
These results were never published. They were too sensitive to the details of 
the, then, poorly determined interatomic forces, and were thus inconclusive. 
(Actually, the desire for an explanation of these particular experimental results 
gave Wigner the impetus to invent Wigner’s function.) 

A general Lorentz invariant theory of Wigner’s function is lacking. This 
is not surprising, since, a) there is no general, satisfactory, classical Lorentz 
invariant formalism of interacting particles either; b) the form of the association 
used by Wigner is deeply linked to Galilean transformations. For special cases, 
however, it is possible to extend the theory in a Lorentz invariant manner using 
light-cone, or null coordinates [40], [41]; this was applied to the description of 
light waves [40], [41]; and to the description of a collection of non-relativistic 
harmonic oscillators [42]. The latter, in turn, can be utilised to express in 
this manner, for a particular model the bound state of hadrons composed of 
quarks, and to calculate the nucleon form factor in some approximation [42]. 
(For Wigner’s general opinions on the problem of Lorentz invariance in this 
description, see his observations at the end of [35]. For other applications, see 
also the book by de Groot et al. on relativistic kinetic theory, quoted below). 

The incorporation of statistics and spin into this association is more del- 
icate. This is natural, since neither the spin variable, nor the Bose-, or the 
Fermi-statistics have any classical equivalent; thus, it is not to be expected 
that a formalism tailored to resemble the classical one should describe these 
modifications in simple terms. [40] contributes to this problem. Other work on 
these matters by different authors are recorded in the reviews listed below. 

There are several general review articles available on the work of Wigner 
and others. The most recent such reviews are by S.R. de Groot [48], S.R. 
de Groot, W.A. van Leeuwen and C.G. van Weert [49], P. Carruthers and 
F. Zachariasen [50], N.L. Balazs and B. Jennings [51], M. Hillary, R.F. O’Connell, 
M. Scully and E.P. Wigner [39]; F.J. Narcowich [52]. There is a considerable 
overlap among the general discussions given in the different reviews, although 
some of them have more special aims as well. Carruthers et al. concentrate on 
scattering problems; Balazs et al. on the variety of different possible associa- 
tions, and their relation to the transformation properties of the different mock 
phase-spaces engendered by them; de Groot et al. stress the applications in 
problems of relativistic kinetic theory. 
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Until the 1960’s the mathematical community took scant notice of these 
ideas, when they were independently rediscovered and used in the theory of 
ordinary and partial differential equations, and in the asymptotic analysis of 
their solutions. 


The theory of linear differential equations with constant coefficients is well 
understood. What happens, however, if the coeflicients are no longer constant, 
but, say, slowly variable? Formally, the equation with constant coefficients can 
be written as an operator function acting on a function where the operator 
contains Pop alone. If the coefficients depend on the variable x as well the 
operator can be now thought of as depending on qop and pop (usually in an 
antinormal ordering, pop acting first). Then, if the x dependence is weak, we 
can take into account this “weak” dependence, by considering pop and qop as 
“nearly” commuting, resembling thus a quantal description near the classical 
limit. Hence, it will be useful to recast the solution scheme in such a way that 
the “classical” solution should form a natural starting point. We are back, in 
the problem-setting of quantum theory, to the problem of the classical limit. 
Thus, the considerations which lead to Wigner’s function will become useful in 
this context as well. 


Wigner’s function, and other similar functions are now reintroduced for 
this purpose. In the mathematical literature they are designated as symbols 
associated with operators. It is easy to associate a function with an operator, 
since the matrix elements of the operator in a representation, the kernel of an 
integral operator, can be thought of as the associated function. These objects, 
however, have shortcomings. The integral kernels in the simplest q or p rep- 
resentation are not only highly singular, but in general it is not easy to see 
immediately their functional relation to the classical dynamical quantities to 
which they correspond. The introduction of these symbols remedies these de- 
fects by requiring them to have the following convenient properties. a) They 
should be singularity free; b) if the original operator is an operator function of 
only the pop, or only of qop, the associated symbol should be the same func- 
tion of the parameter p or of the parameter q as the operator function was of 
the corresponding operator; c) if the operator is a general operator function 
of Pop and qop, the symbol should have a well defined asymptotic expansion 
in h, with the leading term being the same function of the parameters p and 
q as the operator function was of pop and qop; thus the leading term should 
be the classical dynamical quantity; d) the, in general, singular integral kernel 
representing the operator in the q representation should be constructible from 
this, non-singular symbol and a well defined, universal singular contribution. 
Thus the symbol should have a classical significance, have an asymptotic ex- 
pansion, and should express the non-singular part of the kernel in a universal 
manner. The precise formulation of these conditions, the demonstration of the 
existence of such objects, and their actual construction is the task of the theory 
of pseudo-differential operators. 


Let gop, resp. Pop have the eigenvalues x’, resp. k’, with eigenfunctions 
|x’), resp. and |k’). Then the action of an operator A on a general ket |) 
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can be expressed in the q representation as an integral operator with the ker- 
nel (x”|Alx”). The simplest symbol of the operator A is given by o,4(q,p) = 
(x'|Al|k’) /(x'|k’), evaluated for x’ = q, k’ = p. The kernel is then expressed as 


(a! |Ale") = J dk'o a(z", k’) (xk) (Ra), 


which gives the standard transformation formula if the above expression for o A 
is resubstituted. This particular definition of the symbol gives a preferential role 
to the q representation, or to the anti-normal ordering of operators, while the 
adjoint symbol gives a preferential role to the p representation and to normal 
ordering. It has been extensively used by Kirkwood. (See the discussion of the 
Kirkwood association in the review [51].) 

An alternative symbol can be introduced which treats on equal footing all 
representations engendered by the eigenvectors of a general linear combination 
of Pop and qop, and which gives a preferential role to operators ordered according 
to Weyl’s prescription. These turn out to be the so called Weyl symbols, with 
Wigner’s function being the Weyl symbol of the density operator. The kernel 
can be simply expressed as 


(a! |Ala!) = J dpdg Aw (p, 4) («'|Aw(p, 4) |x”). 


(Thus we see that the Aw operator introduced to produce Wigner’s func- 
tion serves also to express the singular part of the kernel, while the Weyl symbol 
is used for the non-singular factor. A similar A operator can also be introduced 
to specify the singularity of the kernel when using the oA symbol. In general, 
the singular part of the kernel is always extracted by the introduction of a 
suitable singular A operator. The coefficient function is the associated symbol. 
This, then, insures that for each class of symbols the singular part of the kernel 
is extracted by the same singular operator.) 

There is a large literature on pseudo-differential operators. See, for example, 
the magisterial work of L. Hormander [53], with extensive references. (Where, 
however, Wigner is not even mentioned!), or the excellent short review by the 
same author [54]. 

Paper [55] with T. Teichmann deals with the scattering of an electromag- 
netic field, while paper [56] describes problems in the multiple scattering of 
particles. The two papers are independent of each other but both show a mas- 
terful understanding of the mathematical tools required. 

In [55] a perfectly conducting, simply connected cavity is considered with 
holes. The tangential components of the electric field is to vanish on the perfect 
conductor, but not necessarily in the holes. What is a complete set of character- 
istic fields in terms of which any cavity field can be expanded? In the absence of 
holes the complete set is given by the normal modes of the cavity, with the tan- 
gential components of the electric field vanishing on the whole boundary. Then, 
the associated normal frequencies are all different from zero. The presence of 
holes, however, allows zero frequency solutions, associated with possible static 
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and irrotational electric and magnetic fields. For standard boundary conditions 
only the static, irrotational magnetic field, or its potential is needed to complete 
the normal modes. This potential is a harmonic function which couples at the 
boundary with the normal modes. The paper then gives a constructive method 
to solve this complicated boundary value problem, and to find approximations 
to it. 

The study [56] applies the ideas and methods of group theory to problems 
in multiple scattering. Here, the prototype of an elementary act is a free flight, 
followed by a scattering, whose properties are given probabilistically. There are, 
however, many different possibilities. In general, an elementary act changes the 
state of the system. Let the elementary acts form a group, and suppose that, 
a) all states can be reached from a standard state through the operation of the 
group; b) the probability that an elementary act changes the state is invariant 
under the group. Then the evolution equation of the probability distribution 
function in terms of the transition probabilities, the Kolmogorov equation, 
can be written as an integral equation over the group. Two cases can be now 
distinguished. 1) the group has only one element which transforms the standard 
state into a particular given state; in this case the number of parameters which 
characterise the state is the same as the number of parameters of the group, 
and the different states of the particle can be labeled by the corresponding 
group element. 2) If the number of group parameters exceeds that of the state 
parameters there will be a subgroup whose elements do not change the standard 
state. Now the left cosets of this subgroup correspond to different states of the 
particle. Wigner shows that in both cases the same equation holds, but for case 
2 additional conditions must be imposed both on the distribution function and 
on the transition probability. 

The evolution equation reads as 


In+ı(t) = [as falo) PTS, 


where fn(s)ds is the probability that the state s, labeling group element s, 
(within the volume element ds in the invariant group space) has been reached in 
n steps, and P(s~'t)dt is the transition probability that in one step the element 
t within dt is reached starting from s. (P depends only on this combination due 
to the invariance of P under the group. For example, in the usual free flight 
problem this invariance asserts that the change of state depends only on the 
free flight connecting the two states, but not on the starting state as well; stt 
is the separation between s and t.) 

The great advantage of this abstract form - the equation appearing as 
a group integral - is the following. The group integral of the product of the 
convolute and the matrix of the representation is the product of two matrices. 
From this it immediately follows, multiplying the evolution equation by D(t), 
that 


J dt fn41(t)D(t) = J dsfn(s)D(s)IT, 
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where II = f dsP(s)D(s), and D(s) is a representation of s. This equation can 
then be trivially solved. For example, it corresponds to solving convolutions in 
terms of Fourier transforms, if the elementary act corresponds to a free flight 
in an arbitrary direction. There the transition probability was invariant under 
translations, and a Fourier transform appeared, the exponentials in the Fourier 
transformation being “really” the representation of the translation group. Here 
one sees that what is needed in general is the group representation of the group 
under which the transition probability is invariant. 

In fact the equation expressing fn+ı in terms of fn can be used as the 
prototype equation to define the convolution for the two functions fn and P, 
both defined over a group and one of them being invariant under the action 
of the group. It is by now clear that this idea can be exploited in different 
directions. It forms the basis of harmonic analysis over groups; it can be used to 
invent random walks and central limit theorems for random variables invariant 
with respect to groups of transformations. See for example the magnificent 
review by G.W. Mackey [57], or the book by A. Terras [58]. 


V. Studies in Conventional Quantum Theory 


This is an annotation to several papers dealing with some non-group theoretical 
aspects of quantum theory. These are [59], [20] with von Neumann, [60] with 
von Neumann, [61], [62], [63]. Save the last one, all belong to the early period 
of Wigner’s work. Three of them, [59], [61], and [62] were commissioned (es- 
sentially) by Rudolf Ortvay, who was (until the end of 1944) the head of the 
Institute of Physics at the Peter Pazmany Scientific University in Budapest, 
and the Editor of the Hungarian Journal of Mathematics and Physics. They 
were eventually presented by Ortvay in two sessions of the Hungarian Academy 
of Sciences. One can find some information on these matters in [64] which is 
a reprint of Wigner’s letters to Ortvay. In a letter dated May 20, 1932 there 
is a brief outline of [61], a paper which deals with the nature of “heavy” (or 
Chadwick’s) neutrons, and “light” (or Pauli’s) neutrons, the latter being now 
called neutrinos. (There are also further allusions to two other possible papers, 
conceivably to be published in the Hungarian journal. One of these would deal 
with time reversal, the other with the magnetic moment of stationary states. 
They were apparently not published.) On June 25 of the same year Wigner 
answers some observations of Ortvay on the neutron, neutrino problem, al- 
though in the mean time (June 13) [61] was read at the Hungarian Academy 
of Sciences. 

In the paper itself Wigner discusses the question whether the light neutrons 
are to be the same as the heavy neutrons, or the possibility that the two are 
different; the light neutrons’ mass being that of an electron, (“for symmetry 
reasons”, as Wigner says.) He further argues that possibly in -decay a light 
neutron and electron leave always together, and that a nucleus always contains 
an equal number of electrons and light neutrons. This would allow the correct 
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statistics for a nucleus, even if electrons are present in the nucleus, as was 
supposed at that time. He adds an interesting note in proof, arguing against 
Heisenberg’s supposition (1932) that a neutron is a bound state of an electron 
proton system. If this were so, then the lifetime of an H atom would be finite 
due to a transition from a 2p state of H to a neutron state. Such 2p states do 
appear if the H is in a chemical compound, and Wigner estimates the lifetime 
of such a compound to be of the order of magnitude of a month. 


A letter dated Oct. 14, 1934 is a covering letter to [62] which describes what 
is now known as the Wigner-Brillouin perturbation theory. A letter dated Dec. 3 
of the same year communicates to Ortvay Wigner’s realisation that Brillouin 
already published a formal series which corresponds to his result. However, as 
he points out, Brillouin obtains his results in a different way, and does not 
discuss the most important property of the perturbation theory, to wit, that 
it approximates the energy eigenvalues from above. In the published paper 
reference is made to Brillouin’s series. (In fact, both Brillouin and Wigner 
missed an earlier paper by Lennard-Jones [65] where the same perturbation 
theory has been proposed and utilized!) 


To summarise, I quote here from the Hungarian letter. “The present study 
aims to find an approximation scheme which approximates the ground state 
energy from above, similarly to a variational method where each approximation 
is a mean value of the operator H + V with respect to a normalised wave 
function. If we evaluate the expectation value of H + V with respect to a trial 
function {4a}, the problem amounts to minimising the expression {4b}. If in 
this expression we omit from the numerator the double sum, the minimum is 
reached when the {trial parameter} a has the value given in {5a}. Reinserting 
this value in the complete expression {4b} gives then the implicit equation 
{5b} for the determination of this value. This solution, which corresponds to 
the third approximation of the Rayleigh-Schrödinger equation, always gives a 
value too large. The higher approximations can now be obtained by replacing 
in the double sum, which is bilinear in a, one of the a factors with the other 
one as unknown.”... “The approximate expressions ... can be easily generalised 
for n = oo, giving a formal solution of the eigenvalue equation. In particular 
cases one can easily show that the infinite series converge to the true solution, 
and it is clear that the present method converges in many situations when the 
original method fails.” (The labeling of the equations via the {} braces are by 
me.) 


Paper |61] is a pedagogical paper showing that for composite systems Bose- 
Einstein, or Fermi-Dirac statistics is applicable depending whether the number 
of constituents are even or odd. It also contains an extremely up to date (1928) 
discussion on Heisenberg’s new (1928) observation that the absence of even, or 
odd lines in the bandspectra of molecules signal the symmetric or antisymmetric 
states of the nuclei; thus they can be used to specify the statistics of nuclei using 
optical observations. (There seems to be an accidental error in the conclusion 
of the Hungarian text (p. 582), where the words even and odd are interchanged. 
The German summary also contains an error; ‘*N is not a fermion.) 
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Papers [20] and [60], both written with v. Neumann and published back 
to back, raise two important problems in quantum theory. [20] deals with the 
problem of possible degeneracies in adiabatic perturbations. Let the Hamilto- 
nian of a system depend on a set of real parameters and consider in a parameter 
space a curve describing an orbit of the parameters. To each point on the or- 
bit a spectrum of the Hamiltonian (with the associated eigenfunctions) can be 
computed. What can one say about the behaviour of the spectrum? In partic- 
ular, under what conditions can the spectrum pass through degeneracies while 
the parameters advance along the path? If one considers the energy levels as 
a function of the parameters, the points where this does occur are called by 
M.V. Berry diabolical points, since the intersecting energy levels have the lo- 
cal appearance of a diabolo (and can be devilish to treat). The paper shows 
that this cannot happen in general unless the parameter space is at least three 
dimensional. (For real Hamiltonians two dimensions are already sufficient, al- 
though this is not explicitly discussed in the paper.) The utilisation of these 
results became essential in chemistry and in nuclear physics where the intersec- 
tion and non-intersection of energy levels in adiabatic perturbations are crucial 
for many phenomena. (See, for example the report by M.V. Berry [66], or the 
collection of papers in [67].) 


The notion of investigating the properties of the spectrum while the param- 
eters orbit in the parameter space turned out to be extremely useful. Remark- 
ably, the next step, the systematic study of the eigenfunctions as a function 
of the parameters was done only much later and led, in the hands of others, 
in particular M.V. Berry, to beautiful and deep results, as, for example to the 
notion of the geometric phase (when the orbits in the parameter space are 
closed). (See the collection [67]). 


Paper [60] studies two examples of eigenvalue problems with two particular 
potentials, which can give rise to discrete eigenvalues embedded in the contin- 
uum. The Hamiltonians studied describe three dimensional separable motions 
with a spherically symmetrical potential. The physics of each example is the fol- 
lowing. In the first case the existence of this anomalous eigenvalue corresponds 
classically to a motion which for the given energy reaches infinity during a finite 
time, rebounds from there and performs thus oscillations. (This is the conse- 
quence of a potential which for large r behaves as —r*.) In the other example 
the potential is oscillatory and falls of slowly at oo. Now the positive eigenvalue 
has an eigenfunction that is generated by “resonance” oscillations on the peaks 
and valleys of the potential. The oscillatory potential problem has been put in 
a more general mathematical context and discussed by B. Simon, who has also 
corrected a mistake in [60], and has displayed a class of oscillatory potentials 
for which there is one, and only one such positive eigenvalue. (See [68].) 


The behaviour of the Hamiltonian operator at infinity can give rise to diffi- 
culties in perturbation theory as well. Paper [63] considers the Coulomb poten- 
tial in the hydrogen atom Hamiltonian as a perturbation in a spherical box of 
radius R, using conventional Rayleigh-Schrödinger perturbation theory (with 
Dirichlet or Neumann boundary conditions at R), where the charge functions 
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as the perturbation parameter. The first order correction vanishes, and the sec- 
ond order correction leads, in the limit R — oo, to the correct form for the 
energy but with a numerical coefficient which is too small, irrespectively of the 
boundary condition selected. The remarkable feature here is the occurrence of 
the combination of a finite result with its being incorrect. The incorrectness is a 
consequence of a non uniform convergence in the parameters e? and 1/R; while 
the finiteness is due to the inverse first power dependence of the potential. 
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Über elektrische Erscheinungen bei Stäbchensolen 


A.Szegvari und E.P. Wigner 


Kolloid-Zeitschrift 33, 218-222 (1923) 


(Eingegangen am 23. Juli 1923.) 


1. Stäbchensole zeigen im Gegensatz zu 
anderen viele merkwürdige Eigenschaften ; so 
z. B. die elektrischen und magnetischen Rich- 
tungseffekte, und vielleicht als auffallendste Er- 
scheinung: die hohe induktive Leitfähigkeit, 
die zur Messung von ungewöhnlich hohen Di- 
elektrizitätskonstanten führt. Wie J. Errera!) 
zeigte, ist die „Dielektrizitätskonstante* eines 
Vanadinpentoxydsols sehr stark abhängig von 
seiner Konzentration, seinem Alter und so- 
gar von der Temperatur, ferner von der Spannung 
und Frequenz des bei der Messung angewen- 
deten Wechselstromes. Unter Umständen konnte 
er Dielektrizitätskonstanten messen, die fünf- 
mal (und noch mehr) größer waren als die des 
Wassers. 


1) J. Errera, Koll.-Zeitschr. 31, 59 (1922). 


Der zunächstliegende Gedanke zur Erklä- 
rung dieses Verhaltens könnte die Annahme 
sein, daß die Teilchen des Vanadinpentoxydsols, 
welche bekanntlich?) ausgeprägten Stäbchen- 
charakter besitzen, als elektrische Dipole auf- 
zufassen sind). Diese Betrachtungsweise ist 
aber kaum angängig. Dies geht erstens aus 
dem Verhalten der Vanadinpentoxydstäbchen im 
elektrischen Wechselfeld unter dem Ultramikro- 
skop hervor‘); es zeigte sich nämlich, daß die 
Teilchen im elektrischen Wechselfeld je nach der 
Feldrichtung sich hin und her bewegten ohne 
umzuklappen; wären es Dipole, so dürften 


2) Siehe u.a. Diesselhorst, Freundlich u. 
Leonhardt, Elster-Geitel-Festschrift 1915, 453. 


8) J. Errera, Koll.-Zeitichr. 32, 373 (1923). 
4) H.R. Kruyt, Koll.- Zeitschr. 19, 161 (1916). 
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sie sich, da es sich im allgemeinen um homo- 
gene Felder handelt, höchstens drehen. 
Und zweitens kann keine Ursache angegeben 
werden, die ein polares Verhalten der zwei 
Stäbchenenden bewirkte. 

2. Es möge hier bemerkt werden, daß die 
Teilchen einen tripolartigen Charakter be- 
sitzen. | 

Die Oberfläche eines Kolloides verhält sich 
ja wie eine elektrische Doppelschicht, und zwar 
sitzt die innere Belegung derselben ziemlich 
fest an der Teilchenoberfläche, während die 
äußere als Raumladung vorzustellen ist, die aus 
Ionen besteht, die durch die innere Belegung 
elektrostatisch angezogen sind. Demgemäß 
besitzt die äußere Doppelschicht gegenüber der 
inneren eine große Beweglichkeit. 


Nehmen wir nun an, die innere Belegung | 


entspräche einem an der ganzen Oberfläche 
des Teilchens konstanten Potentialunter- 
schiede gegenüber dem Medium. Da nun 
diese Teilchen aber nicht kugelförmig 
sind, kann hierbei die Oberflächendichte 
nicht konstant sein. Nehmen wir z.B. an, 
die Stäbchen wären angenähert Rotationsellip- 
soide, so müßte sich die Oberflächendichte der 
Spitze zu derjenigen der Seite verhalten wie 
die Stäbchenlänge zum Stäbchendurchmesser: 
sie wäre also an der Spitze viel größer. 
Da aber die elektrische Ladung hier an lonen 
gebunden ist und demnach einer größeren La- 
dungsdichte auch eine größere lonenkon- 
zentration entspricht, müßte bei der vor- 
herigen Annahme (konstantes Potential) auch 
die Ionenkonzentration der äußeren 
Ionenhülle an der Spitze größer sein 
als gegenüber den mittleren Partien. Nun 
trachten sich aber Konzentrationsunterschiede 
bei Ionen infolge des osmotischen Druckes 
auszugleichen; dies muß auch tatsächlich 
stattfinden. Dieser Konzentrationsausgleich dau- 
ert nun so lange, bis die hierdurch verursachten 
neuen Potentialunterschiede von 
Ende gegen Mitte den osmotischen Kräften 
das Gleichgewicht halten (Fig. 1). 


Fig. 1 


Es entsteht demnach ein Tripol (an den 
zwei Enden ein gleiches, aber von der Mitte 
abweichendes Potential); dies müßte man viel- 


an 


leicht bei der Erklärung einiger Erscheinungen 


der Stäbchensole mit in Betracht ziehen. (Auf 


das vorher erwähnte anomale induktive Ver- 
halten kann dies aber keinen besonderen Ein- 
fluß ausüben.) 

Die genaue Berechnung des Effektes ist 


nicht ausführbar, da die Ausgangsdifferential- 


gleichungen nicht zu integrieren sind. 

3. Der letzterwähnten induktiven Erschei- 
nung konnte man durch folgende Betrachtung 
gerecht werden. 


Wir denken uns der Einfachheit halber die 
äußere Belegung als eine das Teilchen auf eine 
gewisse Distanz umgebende Hülle. Wirkt nun 
ein elektrisches Feld auf dieses Gebilde ein, 
so wird es trachten, die Hülle und den 
Kern in entgegengesetzter Richtung 
zu verschieben (Fig. 2). Dieser Verschie- 
bung wirkt aber die elektrostatische Anziehung 
von Kern und Hülle entgegen. Während dieser 
Verschiebung muß also Arbeit gegenüber elek- 
trostatischen Kräften geleistet werden, diese 
wird auch vom angelegten Felde geleistet. 
Wenn sich nun die Feldveränderung umkehrt, 
wird diese entstandene potentielle Energie wie- 
der in Feldenergie umgewandelt. Dieses Ver- 
halten entspricht aber einer Kapazitätsver- 
größerung, also einer Vergrößerung der Di- 
elektrizitätskonstante des Gesamtsystems. 

Es möchte noch hinzugefügt werden, daß 
hierbei eine Richtkraft auftritt, die die Teil- 
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chen (sofern sie nicht von vornherein gerichtet 


waren) parallel zu den Kraftlinien zu richten sucht 


(Fig. 3), aber auf bereits gerichtete Teilchen 
überhaupt kein Drehmoment ausübt, was die 
eingangs erwähnten kataphoretischen Beobach- 
tungen erklärt. 


Die Berechnung des Effektes gestaltet sich 
am einfachsten wie folgt: 


Wir nehmen an, die vorher erwähnte Ein- 
stellung der Teilchen parallel zu den Kraftlinien 
wäre bereits vollständig. Weiter setzen wir 
voraus, daß die Feldveränderungen so langsam 
sind, daß die Wirkung der Hülle auf den Kern 
dem äußeren Feld ständig das Gleichgewicht 
hält, unbeeinflußt von der Viskosität des Mediums 
und der Trägheit der Teilchen. 


Bestimmen wir zunächst den Einfluß des 
elektrischen Feldes auf ein Teilchen von der 
Länge I und der Dicke 2r. Wir setzen des 
weiteren voraus, daß die durch die Scheitel- 
werte. der Feldstärke verursachte Verschiebung 
x relativ zur Gesamtlänge des Teilchens gering 
ist. Die Oberflachendichte sei o, die elektrische 
Feldstärke im Wasser E. Aus Fig. 4 ist er- 


sichtlich, daß man nur die Wirkung derjenigen 
Teile auf das elektrische Feld berücksichtigen 
muß, bei welchem infolge der relativen Ver- 
schiebung die Ladung freigelegt wird. 


EEX LE ia 


Die Wirkung des elektrischen Feldes auf sein 
freigelegtes Ende ist nun entgegengesetzt gleich 
der Wirkung der anderen Spitze auf das Ende: 


(2raxo)? _ 

"ED =E.2raxo (1) 
2rnxo ist die bei der relativen Verschiebung x 
„freigelegte“ Ladung; D bedeutet die Dielek- 
trizitätskonstante des Zwischenmediums (Wasser). 


Aus Gleichung (1) folgt für die „freie“ End- 
ladung l 
2rxxo = E!l D; 


multiplizieren wir diesen Wert mit 1, so be 

kommen wir das Moment m des Teilchens: 
m=1.2rnxso=EI®D (2) 

Führen wir jetzt das Verhältnis h von Länge 


] 
zu Dicke ein; es sei also h = re Ist ferner ọ 


die Dichte des Stäbchens, u seine Masse, so 

wird: . 
13 

u =r’ nle = 1270 

Führen wir diesen Wert (für 13) in Gleichung 

(2) ein: 

E Dh? te 

no 

In einem gewissen Gesamtvolumen v des Sols 

ist das von den Teilchen herrührende Gesamt- 

moment 


EDh?’u 

neo ` 
Bei dem rechts stehenden Ausdruck muß nur 
über x summiert werden; Zu ist aber gleich 
der im Volumen v gesamt gelösten Substanz. 
Beträgt ihre Konzentration c, so ist Sw=cv, 
demnach | 


Sm=323 


a EDI ae EDh2cv 
= or zsu= 

Nun ist bekanntlich die Dielektrizitätskon- 
stante K gleich 


wv 
ai 


(3) 


K=1+4Ar:, (4) 
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wo : die Elektrisierungskonstante bedeutet. Für | sich für ein einprozentiges Sol (c = 10%, 


ein Sol ist z. B.: 
Im +M 
Fe a 


=m ,. 2 M 
wo —— die von den Teilchen herrührende, — 
vE vE 
die von den in Betracht kommenden Hg O - Mole- 
külen herrührende Elektrisierung bedeutet (M 
ist die Momentensumme dieser Wassermoleküle). 
Zur Berechnung der letzteren setzen wir in 
Gleichung (5) *m = O (reines Wasser); wird 
der hierbei sich ergebende Wert von e in 
Gleichung (4) eingesetzt, so folgt, wenn man 
in Betracht zieht, daß jetzt K=D ist: 
M D—1 
vE 4x (6) 


DA 
Setzen wir nun in Gleichung (5) für = den 
v 


Wert aus Gleichung (3), für = den Wert aus 


Gleichung (6) ein, so wird: 
_ Dh’c D— 1 
= TỌ An ` 
Dieser Wert in Gleichung (4) eingesetzt ergibt 
endlich für die Dielektrizitätskonstante des Ge- 
samtsystems: 


K=D (14412). (7) 


Die Formel (7) besagt demnach, daß die für 
den Effekt ausschlaggebende Größe h, die „re- 
lative Länge“ der Teilchen, ist. Für annähernd 
kugelförmige Teilchen, wo h nahe gleich 1 ist, 
beträgt er höchstens einige Prozente der Di- 
elektrizitätskonstante des Mediums. Ferner ist 
die Dielektrizitätskonstante von der Ober- 
flächendichte, also vom Entladungszustande des 
Sols, ferner vom Dispersitätsgrad (in erster 
Annäherung) unabhängig. 


Die aus der Gleichung (7) berechneten Werte 
sind der Größenordnung nach richtig. Man 
kann nämlich ultramikroskopisch durch Aus- 
zählung feststellen, daß z. B. in einem Fall bei 
einem älteren Vanadinpentoxydsol die Teilchen, 
die länger als 15 u waren, einen großen Anteil 
des gesamten Soles ausmachten5). Da die Dicke 
der Vanadinpentoxydnädelchen. nicht auflösbar 
war (keine Trennung der Interferenzstreifen 
beider Ränder), kann für die obere Grenze 
der Dicke 0,5 u genommen werden; dieser 
Wert ergibt für h ungefähr 30, daher ergibt 


5) Die Sole sind nämlich polydispers, zum Teil 
sogar amikroskopisch, und es soll ja ein mittleres h 
geschätzt werden. 


ọ = 3,4) eine Dielektrizitätskonstante von 960. 


Nach der Formel (7) ist dieser Effekt bei 
jedem Stäbchensol vorhanden, wenn die ge- 
machten Voraussetzungen erfüllt sind, daß also 
unter anderem die äußere Belegung als eine 
zusammenhängende Raumionenladung betrachtet 
werden kann; bei perlschnurartigen Gebilden, 
wo die Stäbchen durch „gerichtete Koagulation“ 
entstanden sind, wird dies wohl nicht immer 
der Fall sein. Sind ferner die Schwingungen 
sehr rasch, so sind zwei weitere Vereinfachungen 
nicht mehr angängig, daß nämlich von der 
Trägheit der Teilchen und der Viskosität des 
Mediums abgesehen werden kann. Würde man 
auch diese in Betracht ziehen, so würde die 
Formel (7) eine Zeitgröße enthalten, also z. B. 
eine Funktion der Schwingungszahl des elek- 
trischen Feldes sein. 

Die Anwendung der Formel (7) gestattet 
auch Schlüsse in anderer Richtung zu ziehen. 
Es ist nämlich eine vielleicht noch offene Frage, 
ob die Teilchen des Vanadinpentoxydsols durch 
Kristallisation oder durch gerichtete Koagulation 
entstanden sind. J. Errera®) fand nun, daß 
die Dielektrizitätskonstante der Vanadinpent- 
oxydsole beim Altern stark zunimmt. Dies be- 
deutet also, daß hierbei h stark wachsen muß; 
denn von einer sonstigen Dispersitätsgradver- 
änderung ist ja die Dielektrizitätskonstante (in 
erster Annäherung) unabhängig. Ein Wachsen 
von h bedeutet aber, daß die Teilchen in der 
Länge im Verhältnis dauernd rascher, also 
„beschleunigt“, wachsen als in der Dicke. Bei 
einem Kristallisationsvorgange würde man an- 
nehmen, daß die Wachstumsgeschwindigkeiteiner 
jeden Kristallfläche normal zur Oberfläche kon- 
stant bleibt, dies würde also für das Ver- 
hältnis von Wachstumsgeschwindigkeiten in 
zwei verschiedenen Richtungen ebenfalls gelten. 
Der früher erwähnte Befund, nach dem sich 
also h beim Altern ändert, würde demnach so 
zu deuten sein, daß das Wachstum der Vana- 
dinpentoxydteilchen nicht durch einen ge- 
wöhnlichen Kristallisationsvorgang 
bewirkt wird. 


Endlich sei bemerkt, daß die Erscheinung 
eine typisch kolloidchemische bzw. kapillar- 
physikalische ist, da sie ja ihren Grund in der 
elektrischen Doppelschicht hat, also in der 
Heterogenität bzw. Zweiphasigkeit des 
Systems. Wie die Kataphorese, Elektroend- 
osmose usw. die elektrischen Gleichstrom- 


6) J. Errera, Koll.-Zeitschr. 31, 63 (1922). 
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erscheinungen der Kapillarphysik sind, ist 
das induktive Verhalten der Stäbchensole eine 
den Gleichstromerscheinungen dem Wesen nach 
vollständig analoge Wechselstromerschei- 
nung dieser Disziplin’). 


Zusammenfassung. 


1. Die Teilchen der Stäbchensole müssen 
sich tripolartig verhalten. 

2. Das von Errera beobachtete anomale 
induktive Verhalten der Stäbchensole läßt sich 


7) Eine weitere Bestätigung der Gleichung 7 ist 
die von Errera (Koll.-Zeitschr. 31, 62 [1922]) ge- 
messene Abhängigkeit der Dielektrizitätskonstante von 
der Konzentration des Sols. Er fand, daß (K— D) 
proportional zur Konzentration ist, ganz wie es Glei- 
chung 7 verlangt. 


qualitativ wie quantitativ durch die elektrische 
Doppelschicht begründen. | 


3. Die Dielektrizitätskonstante solcher Sy- 
steme ist, wenn man von anderen Einflüssen 
absieht, proportional mit dem Quadrat der ,re- 

Länge 


2 
lativen Länge (=) ) der Teilchen. 


4. Die Anwendung der sich ergebenden 
Formel auf das Verhalten von Vanadinpentoxyd- 
solen beim Altern läßt folgern, daß das An- 
wachsen der Vanadinpentoxydstäbchen nicht 
durch eine gewöhnliche Kristallisation erfolgt. 


‘Wir möchten nicht verfehlen, Herrn Pro- 
fessor Freundlich für viele wertvolle Rat- 


| schläge und Hinweise bestens zu danken. 


Aus dem Kaiser- Wilhelm- Institut für physikalische Chemie und Elektrochemie, 


Berlin- Dahlem. 


Die Gitterstruktur des rhombischen Schwefels 


H. Mark und E. P. Wigner 


Zeitschrift für physikalische Chemie 111, 398-414 (1924) 


(Eingegangen am 8. 5. 24.) 


I. Einleitung. 


Wenn man einen Blick auf die Tabelle der Elemente wirft, deren 
Struktur bisher bestimmt worden ist, so sieht man, dass hauptsächlich 
die hochsymmetrischen Strukturen bekannt sind, da zu ihrer Bestim- 
mung eine geringere Zahl experimenteller Feststellungen bereits hin- 
reicht, während die Bestimmung niedrigsymmetrischer Strukturen 
srössere Schwierigkeiten bereitet. Bei kubischen (tetragonalen, hexa- 
gonalen) Elementarkérpern hat man nämlich nur eine bzw. zwei 
Achsenlängen zu bestimmen, während z. B. im rhombischen System, 
deren drei festgelegt werden müssen. Darauf ist es wohl zurückzu- 
führen, dass bisher — abgesehen von Kohlenstoff und Silicium — nur 
Metallstrukturen gut erforscht sind. 

Nun sind aber eben die Elemente aus der 5., 6. und 7. Reihe des 
periodischen Systems, also hauptsächlich P, S, J solche, deren Struk- 
turen den Chemiker am meisten interessieren. Erstens weiss man hier 
aus rein chemischen Ursachen (Dampfdichte, osmotischer Druck in 
Lösungen und Verwandtes), dass die Atome zu Molekülen vereinigt 
sind und es ist sicher nicht ohne Interesse zu sehen, ob und wie dies 
in der räumlichen Anordnung der Beugungszentra zum Ausdruck 
kommt. Zweitens sind hier mehrere allotrope Modifikationen gefunden 
worden, so dass sich in diesen Fällen die Möglichkeit des Studiums 
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eines Umwandlungspunktes durch Vergleich der beiden Gitter ergibt!). 
Die genaue Untersuchung der beim Umwandlungspunkt obwaltenden 
Verhältnisse könnte nämlich neues Material zur Beurteilung der Bin- 
dung der Atome im Gitter an den Tag fördern. 

Unsere Wahl fiel von allen hier in Betracht kommenden Elementen 
deshalb auf den a-Schwefel, weil davon grosse und schön ausgebildete 
Kristalle vorlagen, die für die von uns verwendete Methode notwendig 
waren. Auch weiss man ziemlich sicher, dass der Schwefel in der 
8., also rhombisch-bipyramidalen, Kristallklasse kristallisiert?). Sir 
W. H. Bragg?) hat auch die Identitätsperioden des Schwefels schon 
im Jahre 1914 zu bestimmen versucht und von K. Weissenberg und 
dem einen von uns‘) waren bereits an kleinen Kriställchen eine Reihe 
von Drehdiagrammen aufgenommen worden, welche die Braggschen 
Achsenlängen bestätigten, der Annahme einer Basisflächenzentrierung 
dieses Elementarkörpers jedoch widersprachen. Da bereits in diesen 
Aufnahmen einige (allerdings sehr schwache) Punkte auftraten, die mit 
den gefundenen Achsen (a = 53 A, b = 6.4 A, c = 12.2 A) im Wider- 
spruch standen, schien es uns notwendig, an grossen, schön ausgebil- 
deten Kristallen die Untersuchung fortzusetzen. 


II. Das verwendete Material und die verwendete Methode. 


Für die Untersuchung standen uns drei schön ausgebildete Schwefel- 
kristalle zur Verfügung; zwei hiervon waren natürlich und stammten 
aus Girgenti’), während der dritte ein künstlich aus CS, gezüchtetes 
Exemplar war. | 

Ausserdem verfügten wir zur Herstellung von Drehdiagrammen 
über eine grosse Zahl gut ausgebildeter kleiner Kriställchen von 1 bis 
2 mm Grösse. 

An den grossen Einkristallen fanden sich die in der Tabelle 1 
enthaltenen Kanten und Ebenen wohlausgebildet. 


1) Vgl. die von Westgren am Eisen durchgeführte Untersuchung. Zeitschr. f. 
physik. Chemie 102 (1922). 

2) Herrn Prof. Dr. A. Johnsen verdanken wir den Hinweis, dass auch die rhom- 
bisch-bisphenoidische Klasse in Betracht zu ziehen sei. 

3) W. H. Bragg, Proc. Roy. Soc. London (A) 89, 575 (1914). 

4) Zeitschr. f, Elektrochemie (1923). 

5) Den einen verdanken wir der Güte des Herrn Prof. F. Becke in Wien, den 
anderen haben uns die Herren Prof. A. Johnsen und C. Belowski in liebenswirdigster 

Weise zur Verfügung gestellt. 
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Tabelle 1. 
Kanten Ebenen 
[010] (001) 
[110] (010) 
(101! (011) 
[011] (111) 
(113) 


Da es zur Vermessung des Elementarkörpers darauf ankam, alle 
drei Achsenlängen aus Drehdiagrammen direkt zu bestimmen und zur 
Kontrolle, sowie zur sicheren Feststellung des Bravais-Gitters auch 
die drei Flächendiagonalen und die Raumdiagonale direkt nach der 
Polanyischen Gleichung zu vermessen, mussten Schichtliniendiagramme 
um [100], [010], [001], [110], [101], [011], [111] angefertigt werden. Vier 
dieser Richtungen waren nach Tabelle 1 als Kanten ausgebildet, also 
leicht zu justieren, von den anderen dreien liess sich [001] ebenfalls 
sehr leicht einstellen, da die darauf senkrechte Ebene (001) gut aus- 
gebildet war. [100] haben wir dadurch eingestellt, dass die in (001) 
auf [010] senkrechte Richtung aufgesucht wurde; während die Ein- 
stellung von [111] etwas schwieriger zu bewerkstelligen war. 

Ausser der Bestimmung dieser Identitätsperioden wollten wir aber 
auch sämtliche wichtigen Ebenen nach der Braggschen Gleichung zur 
Reflexion bringen, um über den Intensitätsabfall der verschiedenen 
Ordnungen sowie über das Fehlen von Interferenzen dieser wichtigsten 
Netzebenen sicheren Aufschluss zu erhalten. Es wurden also sämt- 
liche in Tabelle 1 aufgeführten Ebenen direkt zur Reflexion gebracht; 
das als Wachstumsfläche niemals auftretende vordere Pinakoid (100) 
wurde durch Anschleifen freigelegt. 

In der Literatur finden sich über das Gitter des rhombischen 
Schwefels die beiden bereits zitierten Angaben, welche die Schwierig- 
keiten erkennen liessen, die sich einer sicheren Bestimmung in den 
Weg stellen würden. Es sind dies grosse Perioden als Kanten des 
Elementarkörpers, niedrige Symmetrie und das „schlechte Re- 
flexionsvermögen“ des Schwefelatoms. 

Die von Bragg angegebenen Perioden sind: a = 5.3, b = 6-4, 
c = 12-2 A, über das Reflexionsvermögen findet sich bei Bragg fol- 
gende Angabe!): „die vom Schwefel gelieferten Spektra sind alle 
schwach, besonders im Falle der (101) Ebene und in keinem Falle 
sind Spektra von höherer Ordnung als der ersten aufgefunden worden.“ 


1) W. H. Bragg, Proc. Roy, Soc. London (A) 89, 575 (1914). 
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Die Umgehung dieser Schwierigkeiten gelingt dadurch, dass man 
das schlechte Reflexionsvermögen durch ausgiebige Beleuch- 
tung (grosse Intensität des Primärstrahles, lange Beleuchtungsdauer) 
kompensiert. 

Grosse Kantenlängen eines Elementarkörpers bewirken bei ge- 
gebener Strahlung, dass die niedrig indizierten Interferenzen sich in 
der Nähe des Durchstosspunktes anhäufen und besonders bei niedriger 
Symmetrie sehr bald eine Vieldeutigkeit in der Zuordnung von re- 
flektierenden Ebenen zu den gemessenen Linien auftritt. Um dies zu 
verhindern, haben wir eine Kamera mit sehr grossem Durchmesser 
(d = 30 cm) gewählt und als monochromatisches Licht im Gegensatz 
zu Bragg, welcher Molybdän (A = 0-712) verwendete, Kupfer (2 = 1.54) 


S—> 


Fig. 1. 


gebraucht. Durch Anwendung der Schneidenmethode gelang es ferner- 
hin, scharfe Linien (vgl. etwa die Fig. 5 um [110]) zu erzielen, deren 
Vermessung grosse (Genauigkeit gewährte. Ausserdem wurde der 
Kristall immer nur um einen bestimmten Winkel und nicht um 360° 
gedreht, so dass von vornherein nur wenige, ganz bestimmte Ebenen 
zur Reflexion gelangten. Hierdurch wurde erreicht, dass in keinem 
Falle bezüglich der Indizierung eine Zweideutigkeit möglich 
war. Die verwendete Kamera, welche der Seemannschen nach- 
gebildet war, und die Rechenmethode seien im folgenden kurz beschrieben. 

Ein Kristall X (Fig. 1) wurde mit einer bestimmten Ebene, der 
„eingestellten Ebene“, gegen eine Platte W gelehnt und mit dem kon- 
vergenten Bündel B von monochromatischen Röntgenstrahlen be- 
leuchtet. Als Blende dient die Wolframplatte W, die gegen das Kristall 
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gerichtet ist. Der Film F war in der Entfernung von 5 = 15cm in 


einer Zylinderfläche von der Höhe von etwa 10 cm ausgespannt. Die 
Spitze der Blende W war in etwa !/,, mm Entfernung von der Ober- 
fläche des Kristalls, so dass die Linien etwa diese Breite hatten. 

Der Öffnungswinkel des Röntgenstrahlenbüschels sei «,, des Films Oo. 
Die reflektierende Netzebene schliesse mit der Richtung SF’ den Winkel e 


ein. Ist dann I der Glanzwinkel dieser Ebene, so erscheint das 


Bild ihrer Linie L unter dem Winkel 8, wo 
od 
p = e+ pI (1) 


ist. Dafür dass es überhaupt erscheine, ist (wenn die Linie am Äquator 
liegt) notwendig und hinreichend, dass 


o< eca (2) 


0< $ teL o (3) 


wie aus der Fig. 1 ersichtlich. Wenn (2) nicht zutreffen würde, würde 
die Ebene überhaupt nicht zur Reflexion gelangen; würde (3) nicht 
zutreffen, so käme das Bild nicht auf den Film. 

Bei unseren Aufnahmen war a, ~a,~ 45°. Für diesen Fall ist 
der für uns in Betracht kommende Bereich in Fig. 2 schraffiert ge- 
zeichnet. Wir ersehen daraus, dass die 
Ebene höchstens den Winkel von 22'/,° mit 
der „eingestellten Ebene“ einschliessen darf, 
der Glanzwinkel dem absoluten Wert nach 
grösser sein muss, als der Winkel e zwi- 
schen den beiden Ebenen und dass die 
Summe der Absolutwerte der beiden Winkel 


97 


> +€ kleiner sein muss als 45°. Diese 


Bedingungen erwirkten, dass die Linien un- 
Fig. 2. zweideutig indizierbar waren. 

Das „konvergente Strahlenbündel B“ 
wurde so erzeugt, dass wir die ganze in Fig. 1 abgebildete Vorrichtung 
um die Achse O um a, in der Richtung D durch ein Uhrwerk drehen 
liessen, während der Strahl in Wirklichkeit immer nur in der Rich- 
tung S (mit einem Offnungswinkel von etwa 5°) kam. 
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Wenn etwa (uvw) die „eingestellte Ebene“ war, und etwa [hk] 
die Drehrichtung darstellte, so wurden zunächst zur Indizierung des 


Äquators die Winkel &...... e; zwischen den Ebenen (rst), für welche 
hr- ks + lt = 0, und der Ebene (wvw) berechnet nach der Formel 
1 ur vs wt 
| cosers: = z—y— (fe t +S) (4) 
wobei 


ist. Diese Formeln gelten nur für einen rhombischen Elementarkörper 
mit den Achsen a, 5 und c. Dann wurden aus ihrer Gesamtheit die- 
jenigen ausgewählt, welche im Verein mit 3,,, die Bedingungen (2) 
und (3) erfüllten. 9,,; wurde hierbei nach der quadratischen Form 
) 2 2 2 2 
sin t= (Bt Rt a) m 
berechnet. Diese Methode hat also die Kenntnis der Achsen a, b und c 
zur Voraussetzung. Da die Kantenlängen des Elementarkörpers vorher 
in unabhängiger Weise durch die Polanyische Gleichung 
RL. (8) 
COS 4t 
(wobei J die Identitätsperiode, also in diesem Falle die Kantenlänge, 
à = 1.54 die Wellenlänge des Lichtes, 90°— u den Winkelabstand 
vom Aquator und n die Ordnungszahl der Schichtlinien bedeuten) be- 
stimmt worden war, war sie anwendbar. 
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Für die Indizierung der Schichtlinie (Fig. 3, dabei ist die Dreh- 
achse, mit OX gekennzeichnet, vertikal angenommen) wurden wiederum 
zunächst die Winkel e bestimmt, welche die auf der ersten Schicht- 
linie auftretenden Ebenen (rs, 4), für welche hv, + ks, + lt; = 1, mit 
(uvw) bilden. Die Winkel, welche die Normalen der Ebenen (r,s, 4) 
mit der Drehrichtung einschliessen, sind 


cos (90° — ce) = u nn (9) 
Jast Vh?a? -+ k2b2 + e? 
Der Winkel zwischen dem einfallenden Strahl und der Projektion B 
der Ebenennormalen in die Horizontale ist 


sin A 
2 
cosa = 


(10) 


COSC 

Die Winkel, welche der einfallende Strahl S mit den Projektionen 
der rellekkierien. Strahlen in die Horizontale einschliesst, sind: 

cosa’ = SY (11) 
sin u 

Die Winkel zwischen der Normale der ,,eingestellten Ebene“ und 
der Projektion der Normale der reflektierenden Ebene in die Hori- 
zontale ist: 


(12) 


Aus diesen berechnet sich der horizontale Winkelabstand der 
Linie vom Durchstosspunkt am Film 


c= 90°— a ao p. i (13) 
Der vertikale Abstand ist durch die Schichtlinienbeziehung ge- 
geben. 


HI. Bestimmung des Elementarkörpers und der Translationsgruppe. 
Die Strukturbestimmung erfolgte in drei Schritten. Zunächst 
wurden durch Drehdiagramme um die Hauptachsen die Identitäts- 
perioden in diesen Richtungen, d. h. die Grösse des Elementarkörpers 
festgestellt. 
In den nachfolgenden Tabellen sind die den verschiedenen Schicht- 
linien entsprechenden doppelten Schichtlinienabstände unter 2e, die ctg 


des Öffnungswinkels des Schichtlinienkegels unter L, der cos dieses 


Winkels unter cosu und die daraus berechneten Identitätsperioden 
unter J aufgetragen. 
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Das Diagramm der Tabelle 2 ist in Fig. 4 abgebildet. 


Tabelle 2. 
Drehdiagramm um [001] (Fig. 4). 
Nr 2e EA cos u J | Mittel 
r i 
| | 
I 9.5 0-0633 | 0-0633 24-48 
II 19-5 0.1300 | 0.127 24.62 
II 29.5 0-1968 | 0-192 24.40 24-56 
IV | 40.0 0.2670 | 0-259 24.60 ORs 
vV 49.5 0.330 | 0.313 24-63 
VI | 64.0 0.406 | 0875 | 24.65 | 


: Fig. 4. 
Drehdiagramm um die c-Achse mit Cu-Strahlung aufgenommen. 


Tabelle 3. 
Drehdiagramm um [010]. 
Nr. 2e — | cos u | J | Mittel 
I | 18-0 0-120 0-119 12-92 
II 37-0 0.246 0.238 . 12.88 12.87 


III 57-0 0-380 0-360 12-80 
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Tabelle 4. 
Drehdiagramm um [100). 
Nr 2e | ee cos u J | Mittel 
r 

I 22.0 0.147 0.144 10-62 

II 45.0 0-300 0-288 10.62 10.61 
II 66-5 0-443 0-400 10-60 


Das röntgenographisch gefundene Achsenverhältnis ist somit 
0-820:1:1-900; es stimmt mit dem goniometrischen 0-8109:1:1.9005 1) 
hinreichend überein. Wie man sieht, sind die Achsen genau doppelt 
so gross als die von Bracg gefundenen, was seine experimentelle Be- 
gründung dadurch erfährt, dass auf unseren Diagrammen an den Stellen 
der ungeraden Schichtlinien sicher Linien nachweisbar waren. Einige 
solche Reflexionen sieht man z. B. in Fig. 4, sie sind dort durch ein 
x hervorgehoben. 

Die einzelnen Schichtlinienabstände konnten nicht leicht ver- 
messen werden, da keine Punkte, sondern Linien entstanden. Es musste 
also der Abstand zwischen den Mittelpunkten dieser Linien abgeschätzt 
werden. Zur grösseren Sicherheit wurden noch andere Diagramme 
um dieselben Richtungen aufgenommen, bei welchen aus dem ein- 
fallenden Strahl durch einen horizontalen Schlitz ein niedriger Bereich 
ausgeblendet, so dass die Interferenzen beinahe punktförmig und die 
Schichtlinienabstände gut vermessbar wurden. Auch durch Dreh- 
diagramme mit kleinen Kriställchen wurden diese Identitätsperioden 
überprüft. 

Aus der Dichte des Schwefels 2.0372) ergab sich, dass im Elemen- 
tarkörper ; 
10.61 . 12.87 . 24.56 . 1022. 6.06 - 1023 

32.07 =. 


das ist etwa 129 Atome, enthalten sind. Diese Zahl ist später dadurch 
verbessert worden, dass die (001)-Ebene — wie wir sehen werden — 
erst in 8. Ordnung auftritt, was einer achtfachen Unterteilung der 
c-Achse entspricht. Die Anzahl der Teilchen im Elementarkörper muss 
also durch 8 teilbar sein, was die Zahl 128 festlegt. | 

Nach derselben Methode wurden durch Drehung um [011], [101], 
[110] die Identitätsabstände in der Richtung der Flächendiagonalen 


1) P..v. Groth, Chem. Kristallogr. 1, 26. 
2) P.v. Groth, loc. cit. 
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bestimmt. Es ergab sich, dass der Elementarkörper allseitig flächen- 
zentriert ist, da die bezüglichen Abstände halb so gross sind, als die 
aus den Achsen berechneten. Ein Diagramm um [110], welches mit 
MoK „-Strahlung aufgenommen wurde, ist in Tabelle 5, ein mit Cu- 
Strahlung aufgenommenes in Tabelle 6 wiedergegeben. 


Tabelle 5. 
Drehdiagramm um 110. 
Nr. 2e aa cosu J | Mittel 
r 

I 4.8 0.0839 0.0839 8-32 

II 9.7 0-170 0.168 8-30 8.35 
II 14-4 0.252 0.245 8-44 Í 
IV 19-0 0-332 0-315 8-34 


In ähnlicher Weise wurde mit Kupferstrahlung für die Identitäts- 
perioden in der Richtung [101] 13-22, in der Richtung [011] 13-93 
gefunden, während die aus den Achsen berechneten Längen 26-60 
bzw. 27.7 sind. (Für [110] findet man aus den Achsen 16-38.) 

Eine Drehung um die Raumdiagonale [111] bewies, — wie ja vor- 
auszusehen war —, dass der Elementarkörper nicht raumzentriert ist; 
(gefunden 29.25, berechnet 29.6). 

Nun wurden die Linien in sämtlichen Diagrammen identifiziert; 
zunächst auf dem Äquator. In den folgenden Tabellen sind bei beob. 
bzw. ber. die beobachteten bzw. berechneten Werte der Winkelent- 
fernungen vom Durchstosspunkt eingetragen. Der Durchstosspunkt 
wurde von einer Linie aus genau bestimmt, die dann in Klammern ein- 
gesetzt ist. Wie auf S. 402, Formel (1) abgeleitet, ist der Winkel- 
abstand am Aquator 


Na 
p =e + 9? (1) 
wo z einfach den halben Glanzwinkel und ¿ den Winkel zwischen der 
Ebene, die zur Reflexion kam, und der „eingestellten Ebene“ bedeutet. 


Der Winkel 7 wurde nach der bekannten quadratischen Form be- 


stimmt und zwar 


sin? > — 0.005255 h? +- 0.003585 k2 +- 0.000986 12. 


ine nahen Te Ka an gen nn me a a nen ia an a a 
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| Tabelle 6. 

Aquator bei der Drehachse [110]. Eingestellte Ebene (001). (Fig. 5). 
Indizierung Ä (008) , | (0016),, | (008) » | (0016), (1111, 
beob. (14-5) 30.2 | 13-1 27.0 5.5 
ber. (14-5) 30.2 13-1 26-9 5-5 

Indizierung | (1111) | (1113), | (1115), | (2 2 20), 
beob. | 3-5 11-7 17.4 19.3 
ber. | 3-5 11-7 17.4 19-4 

Fig. ða 


\ l ji 


Yay j u W J Wo VA 
vA Ay \ 
nal i y Yu v 


Fig. Sb. | 
Drehdiagramm um die [110)-Richtung mit Cu-Strahlung. Eingestellte Ebene war (001). 


Eine zur sicheren Vermessung und quantitativeu Intensitätsverwer- 
tung angefertigte Photometrierung dieses Diagrammes befindet sich 
unter 5b; sie ist im bezug auf das Originaldiagramm seitenverkehrt. 
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Tabelle 7. 

Äquator bei der Drehung um [110]. Eingestellte Ebene (111). (Fig. 6.) 
Indizierung | (111), z (222), 7 (333), l (444), | (555). 
beob. 5-7 a 17-55 23-7 29.7 
ber. 5-7 17-4 23-5 29-8 
Indizierung | (553), E (331), = (442), — (111), | (222), 
beob. 21-9 4.55 5-15 | 10-4 
ber. 21-7 4.5 51 | 1083 

Indizierung | (333) | (331), = (442), 


Fig. 6a. 


U un ED U 


Fig. 6b. 
Drehdiagramm um die [110]-Richtung mit Cu-Strahlung. 


Eingestellte Ebene war (111). 


Die Photometrierung befindet sich unter 6b. Bei dieser Aufnahme 
konnte der Nullpunkt — unabhängig von den Linien — aus der Br- 


und Ag-Absorptionskante bestimmt werden. 


Des weiteren sei die Auswertung einer Schichtlinie mitgeteilt. 
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| Tabelle 8. 
Schichtlinie bei der Drehachse [110]. Eingestellte Ebene (001). (Fig. 6). 
Indizierung | (119) | (119) | (1111) | (1113) (1115) 
beob. 11-45 18-55 16-4 25-4 29.4 
ber. | 11-4 | 18-6 16-2 25-4 29-4 
Indizierung | (028) | (0 2 10) | (2 0 10) 
beob. 6-6 | 31-95 8 
ber. | 6-5 32.05 8.2 


Die $-Linien sind hier nicht angeführt, da sie von den a-Linien 
durch ihre etwas tiefere Lage leicht zu unterscheiden sind. Die Lage 
des Nullpunktes war vom Aquator her bekannt. 

Auf dieselbe Weise wurden sämtliche in den 7 Drehdiagrammen 
auftretenden Linien indiziert. 


IV. Bestimmung der Raumgruppe. 


Der Statistik der auftretenden Netzebenen entnehmen wir zunächst 

für die Raumgruppenbestimmung folgende Tatsachen: 
1. Bipyramiden mit lauter ungeraden Indizes besitzen nor- 
males R; (111), (335), (1111), (1113) dagegen treten (112), 

(211) usw. nicht auf. 

2. Die Röntgenperioden des vorderen und seitlichen Pinakoids 
sind viertelprimitiv; (001) tritt erst in achter Ordnung auf. 
Die rhombisch bipyramidale Klasse enthält 28 Raumgruppen, von 
denen VŽ — V?’ durch die Feststellung 1. streng ausgeschlossen werden, 
denn es sind Ebenen beobachtet worden {(111) usw.}, welche in diesen 
Gruppen nicht auftreten dürften. Die übrigen Gruppen Vi — Vj bleiben 
zur Diskussion stehen. Nimmt man aber den in der Tabelle 5 und 
in dem nachfolgenden Absatz enthaltenen Befund hinzu, dass die Iden- 
titätsperioden auf den Flächendiagonalen die halbe Länge der Hypo- 
tenusen besitzen, so gelangt man zu einem allseitig flachenzentrierten 
Elementarkörper. Dieses Ergebnis steht mit der Feststellung 1. in 
völliger Übereinstimmung, denn die experimentell gefundenen Aus- 
löschungen sind gerade diejenigen, welche von den beiden allseitig 
flächenzentrierten Raumgruppen gefordert werden. Es sind nunmehr 
V? und V% übrig. Zwischen diesen liesse sich streng unterscheiden, 
wenn man ein Pinakoid in zweiter Ordnung beobachtet hätte. Dies 
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ist aber nicht der Fall; vielmehr sind zwei Pinakoide erst in vierter, 
das dritte in achter Ordnung vorhanden. Auf Grund dieser Erschei- 
nung kann man zwar zu der Aussage kommen, dass die Gruppe V7} 
die gefundenen Verhältnisse am besten wiedergibt; es ist aber nicht 
möglich V} streng auszuschliessen, weil die Auslöschung von (200), 
(020) und (002) eine durch die Freiheitsgrade der Struktur bedingte 
sein könnte. Auch wäre es möglich, dass bei besonders langen Expo- 
sitionen doch Andeutungen dieser Ebenen auftreten könnten. Da wir 
aber auf verschiedenen Diagrammen von sehr hoch indizierten Ebenen 
[z. B. (2, 2, 20)]die a-, œ'-, p- und y-Linien noch sicher feststellen konnten, 
so ist die Auslöschung jedenfalls eine sehr weitgehende und es sind 
keinerlei experimentelle Anhaltspunkte für eine gegenteilige Auffassung 
vorhanden. Demgemäss wollen wir bei der Lokalisierung der 
Punktlagen nur die Gruppe V;' diskutieren. 

Unter der Annahme, dass Schwefel bisphenoidisch kristallisiert, 
ergibt sich nur die allseitig flächenzentrierte Gruppe V”. 

In 77‘ sind nun 128 S-Atome zu lokalisieren. Der einfachste 
Fall ist der, dass vier verschiedene S-Atome da sind, die sich in all- 
gemeinen Punktlagen befinden. Wenn es mehr als vier verschieden- 
artige S-Atome gibt, kommt noch eine Reihe von Möglichkeiten hinzu, 
bei welchen sowohl in allgemeinen als auch in speziellen Lagen 
Atome zu lokalisieren sind.. Wir wollen im folgenden nur den Fall 
von vier ungleichwertigen S-Atomen diskutieren. 

Es sind also vier konstituierende Punkte S,, S, S, und S, zu 
lokalisieren. Die Koordinaten für eine allgemeine Punktlage sind‘): 


Im, n, pl] [m +-4, n+ 4, pl [m+ $, n p+ 4] Im, nr +4, p+ 
im, n, p) [m+ 4, n+4, pl Rt fm, 7+4, p+ 
[m, n, p) [m+ 4, n p) [m+4,,p+4] (m, n+4, PA 
m, n, p] [m+ 4, n+ 4, P) [m+ 4, %, p+ 4] Im, a+3, p+ 4] 
[mti At ptt] mtt ntt PHH 
mtt a+ ptt mtt ntt p+4] 
[m+ 4 a+ P mh, n, Pl 
m+ a+ pti mai nti pt 
[m+ Att pHi imt, ntt P+ 
mtr tn ai ntt pll 
ma +t, A+ PHa mtt nt, +H 
[m+4, 7+3, p +3] m 4t, n+, p il 
1) P. Niggli, Geometr. Kristallogr. d. Diskontinuums, Leipzig 1919, S. 215. 
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Die Struktur besitzt also die Freiheitsgrade 
Mi, My, Pr, Ma, Me, Pa, Mg, Ng, Ps, M4, M4, Pay 


Ss 4 
3’ a? & 
die Verhältnisse der Reflexionsvermögen. Diese Verhältnisse kann man 
mit grosser Wahrscheinlichkeit gleich Eins setzen, da einerseits hetero- 
polare Bindungen zwischen S-Atomen wohl nicht in Frage kommen 
und andererseits Auslöschungen beobachtet wurden, welche durch die 
Raumgruppe allein nicht bedingt sind, so dass rationale Unterteilungen 
durch Ebenen vorkommen müssen, welche sich bei gleicher Atomzahl 
pro A? nur dadurch unterscheiden, dass die einen Ebenen z. B. S,-, 
die anderen S,-Atome enthalten; wären die Reflexionsvermögen ver- 
schieden, so müsste durch ihre Verschiedenheit die irrationale Unter- 
teilung zufällig gerade kompensiert werden. Nun wurden aber diese 
Auslöschungen auch noch mit verschiedenen Wellengängen festgestellt, 
so dass nicht nur die Reflexionsvermögen selbst, sondern auch ihre 
Abhängigkeit von der Wellenlänge so geartet sein müsste, dass in 
jedem der Fälle gerade Kompensation des Parameters eintritt. Da dies 
aber sehr unwahrscheinlich ist, ist es wohl berechtigt, diese Verhält- 
nisse gleich 1 zu setzen. Es bleiben somit noch 12 Parameter zu 
bestimmen, wofür der Strukturfaktor herangezogen werden muss. Da 
eine gleichzeitige Bestimmung sämtlicher Unbekannten ausserordentlich 
kompliziert wird, sei diese zunächst nur für den p-Parameter durch- 
geführt. Die quantitative Vermessung der Intensitäten von (004), 
(008), (0012), (0016) und (0024) bei Sr-Strahlung (Ar, = 0-87 A, 
Akg = 0.78 A) und Cu-Strahlung (Ar, = 1-54, Akg = 1.39) zeigt Tabelle 9. 
Der Strukturfaktor fiir (004) z. B. lautet: 


Fro, = S; {4 e8ripı J de Srim ei 4edrim+H L 4e—8nilp—+) 
+ 4¢e8%tm+h 4 4e 8ail — t L geript 
a 4e—Srilpı—3)) 4 DlA enipe RER ES: fa 
+ 8, tees} 
= 64 (S, cos 8719 + S, cos 8 mip, + Sy cos 8rrpty 
+ S, cos 8 mip,). 
Wenn man 


und 


Q = & =, =S 
setzt, erhält man hieraus 
Foo, = 64 S (cos 8 wip, + cos 8 mip, + cos 8 rip; + cos8rin,). 


The Lattice Structure of Rhombic Sulfur 41 


Cu-Strahlung Sr-Strahlung 


Die Gitterstruktur des rhombischen Schwefels. 413 
Tabelle 9. 
Indizierung Intensität mit Intensitat mit 


0 


(004) | 0 
008) 15 | 12 
(00 12) 0 | 0 
(0.0 16) 28 30 
(0. 0.20) 0 
(0.0 24) 8 
(0.0 28) 0 


Aus der ersten und dritten dieser Intensitäten folgen bereits die fünfte 
und siebente, wenn man — wie wir es getan haben — annimmt, dass 
die Reflexionsvermögen der ungleichwertigen S-Atome gleich sind. 
Berechnet man aus den vier ersten Gleichungen p, bis p,, so kommt 
man zu folgenden Werten: 

pı ~ 0.017, 

Po ~ 0.142, 

p ~ 0-047 , 

9, ~ 0.172. 


Leider stehen keine weiteren Reflexionen zur Verfügung, um die Zu- 
verlässigkeit der p-Werte zu prüfen; sie können daher nur als ein ganz 
versuchsweiser Ansatz gewertet werden. Denn abgesehen davon, dass 
keine Ebene als Kontrolle benutzt werden konnte, muss auch 
darauf hingewiesen werden, dass bei der verwendeten experimentellen 
Methode die theoretischen Grundlagen der für die Intensität gebrauchten 
Formel keineswegs einwandfrei sind, da weder die Bohrsche Absorp- 
tion noch die durch die Reflexion bedingte darin zum Ausdruck 
kommen. Wir sind damit beschäftigt, durch Verwendung härterer 
Strahlung mehr Ebenen zur Reflexion zu bringen und dann unter Be- 
rücksichtigung der Absorptionsverhältnisse aus Drehdiagrammen um 
[100] und [010] auch die Werte von m und n gesondert zu bestimmen. 

Um über die Möglichkeit, in dem gefundenen Gitter abgeschlossene 
Gruppen erkennen zu können, etwas zu erfahren, wollen wir noch 
kurz das Reis-Weissenbergsche Zusammengehörigkeitskriterium 
darauf anwenden. Hierbei ergibt sich, dass sich höchstens Gruppen 
von 16 S-Atomen zusammenfassen lassen. Das Gitter, welches die 
Schwerpunkte dieser Gruppen bilden, besteht aus zwei um 4 4 1 gegen- 
einander verschobenen flächenzentrierten Komplexen; ist also das „rhom- 
bische Diamantgitter“. 
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Zusammenfassung. 


Rhombischer Schwefel kristallisiert rhombisch bipyramidal mit den 
Achsen: 


a = 10.61 Å, 
b = 12.874, 
c = 24.56 A | 


in der Raumgruppe V;*; der allseitig flächenzentrierte Elementar- 
körper enthält 128 Atome. In diesem Gitter lässt sich eine Gruppe 
von héchstens 16 S-Atomen geometrisch zusammenfassen, deren Schwer- 
punkte ein rhombisches Diamantgitter bilden. 


Bildung und Zerfall von Molekülen 
M. Polanyi und E.P. Wigner 


Zeitschrift fur Physik 33, 429-434 (1925) 


(Eingegangen am 27. Juni 1925.) 


Diskussion der Annahme spontanen Zerfalles und einfacher Bildung gequantelter 
Systeme auf Grund der Annahme endlich breiter Quantenniveaus. 


1. In ihrer Arbeit über das obige Thema gehen Born und 
Franck!) davon aus, daß weder eine Bildung eines Moleküls durch 
einfache Assoziation, noch der zugehörige Gegenprozeß, nämlich der 
spontane Zerfall eines Moleküls, vor sich gehen kann. Als Begründung 
wird angeführt, daß es unendlich unwahrscheinlich ist, daß die Relativ- 
energie der zusammenstoßenden Gebilde gerade einen quantenhaft zu- 
lassigen Energiewert des zu bildenden Moleküls beträgt. 

Ein abweichendes Verhalten nehmen Born und Franck an, wenn 
das eine Gebilde zu einem festen Körper ausartet. An einem solchen 
kann ein Atom oder ein Molekül ohne weiteres haften bleiben, weil hier 
schon eine kontinuierliche Mannigfaltigkeit von Energiewerten zulässig 
sein soll. 

Die vorliegende Untersuchung befaßt sich im Gegensatz zu Born 
und Franck damit, die Molekülbildung durch einfache Assoziation und 
den spontanen Zerfall derselben zu diskutieren. Wir gehen dabei von 
der Vorstellung aus, daß die Quantenzustände eine endliche Breite 
haben?) und es daher eine endliche — wenn auch zuweilen geringe — 
Wahrscheinlichkeit dafür gibt, daß zusammenstoßende Moleküle mit 
quantenhaft zulässiger Energie zu einem neuen Molekül zusammentreten. 

2. Das Vorkommen solcher Vorgänge sehen wir bereits durch das 
vorhin erwährte Haften von Molekülen an festen Wänden als erwiesen 
an. Wir stützen uns ferner darauf, daß kürzlich von Auger?) ein Bei- 
spiel für den spontanen Zerfall eines gequantelten Systems geliefert 
worden ist. Er beobachtete, daß ein Atom, dem durch Röntgenstrahlen 
ein K-Elektron entrissen wurde, ein zweites Elektron emittiert. Das 


1) Ann. d. Phys. 76, 225, 1925. 
2) Vgl. z.B. N. Bohr, ZS. f. Phys. 18, 117, 1923 (8. 150 ff.) 
3) C. R. 180, 65, 1925. Vgl. auch L. Meitner, ZS. f. Phys. 17, 16, 1923. 
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Elektron stammt aus der Z-Schale, die Energie wird durch Übergang 
eines zweiten L-Elektrons in die A-Schale geliefert. Aus älteren Ver- 
suchen von Barkla über die Ausbeute der Röntgenfluoreszenz schließt 
Bothe?), daß bei Br,, wo die genauesten Messungen vorliegen, die 
Hälfte der ionisierten Atome die A-Linie emittieren, die andere Hälfte 
dagegen auf die von Auger angegebene Weise zerfallt. Es zeigt sich 
also, daß die Wahrscheinlichkeit des Zerfalles hier gleich der Aus- 
strahlungswahrscheinlichkeit ist. 

Wir wollen nun zeigen, auf welche Weise die Elementarprozesse 
bei einem spontanen Zerfall anzunehmen sind, um ein der Thermodynamik 
entsprechendes Gleichgewicht zu gewährleisten. Als Beispiel möge der 
vorhin erwähnte Fall des im K-Ring ionisierten Atoms dienen. Für die 
Konzentrationen c, der doppelt geladenen Ionen, c, der Elektronen und 
c der einfach geladenen Ionen gilt die thermodynamische Beziehung 


CC = = (xuk T)? 
fs Ar Ban a) 


worin Q die lonisierungsarbeit und 
o mh 
am m, + m, 
bedeutet. (m, Masse des doppelt geladenen Ions, m, Masse des Elektrons.) 

Pro Sekunde sollen cfr einfache Ionen in ein doppelt geladenes und 
ein Elektron zerfallen. 

Anderseits soll bei einem „Zusammentreffen“ von einem Elektron 
und doppelten Ion eine Rekombination erfolgen, falls die Energie der 
Relativbewegung in den quantenhaft erlaubten Energiebereich von der 
Breite 4 e fällt. Ein Zusammentreffen soll zustande kommen, wenn der 
Bahnenabstand kleiner als d ist. Die Anzahl der Rekombinationen ist 
hiernach 


VE 2 -2 1% 
“1% ET) Ya © 72) 


Und als Bedingung des stationären Zustandes hat man 


c Q 
= ua ET 2 
= = ann vo Va e rae ŽS d* x 
Mit (1) kombiniert ergibt dies 

h3 


> — z ‘ 
az Sm’Qude | 2 


1) Phys. ZS. 26, 410, 1925. 
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Für rge setzen wir nun in Übereinstimmung mit der Theorie der 
Linienbreite!) h. Dann folgt 


h 
Rn = z> (3) 


Anderseits ist für eine Rekombination, bei der der Bahnabstand 0 (< d) 
ist, der gegenseitige Drehimpuls gleich OY2Qp. Gleichung (3) besagt 
also, daß dieser Drehimpuls kleiner sein muß als h/2 x. — 


Ein ähnliches Bild wollen wir in der Folge auch der Molekülbildung 
zugrunde legen. | 


Es muß hervorgehoben werden, daß hierdurch der Drehimpulssatz 
bei Rekombinationen aufgehoben wird. Die Quantentheorie erlaubt für 
den Drehimpuls des entstandenen Gebildes nur ganzzahlige Vielfache 
von h/2x. Die vorgegebenen, nicht gequantelten Drehimpulse der sich 
vereinigenden Körper müssen daher auf das nächstliegende Vielfache auf- 
gefüllt werden. Unter dieser Annahme ist die Gültigkeit der Reaktions- 
isochore in allen Fällen gewährleistet, worauf wir uns in der Folge 
stützen werden. 


3. Wir betrachten zwei Atome A und B, die ein Molekül AB bilden 
können. Wenn A und B mit der Relativenergie ¢ sich vereinigen, so 
wird das entstandene Molekül die innere Energie ¢ + Q (Q = Disso- 
ziationswarme) haben. Die Konzentration c’ solcher AB-Moleküle, die 
eine höhere innere Energie als Q haben, im Gleichgewicht mit Atomen 
von der Konzentration c, und c, ist durch die Gleichung gegeben 


CCa _(@2zukT) WM ko 
co h? 8mJkT Q 
Dabei sind die Schwingungen mit der charakteristischen Temperatur ® 


als harmonisch, das Trägheitsmoment J als unveränderlich angenommen °’). 
Bei Annahme einer Lebensdauer r zerfallen pro Sekunde 


de 6,6, Bk VahT Q 4) 
Tt T k VTO u’! ( 

Moleküle AB. Im Gleichgewicht werden ebenso viele gebildet, also ist 
die Rekombinationsgeschwindigkeit durch die rechte Seite der Gleichung 


1) Vgl. N. Bohr, l. c. 
2) Diese Annahmen bieten sicher nur eine grobe Annäherung, doch sollen sie 
hier verwendet werden, weil eine Grundlage für ihre Berichtigung zurzeit fehlt. 
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gegeben. Die Geschwindigkeitskonstante x dieser Elementarreaktion 


hat demnach den Wert 
2Y2ahJQ 1 
ee 2Y2zhJQ 1 (5) 
k'l VT @ pile t 


Dieselbe Formel gewinnt man gemäß Punkt 2 auch, wenn man an- 
nimmt, daß die Rekombination zu einem Molekül mit der inneren 


h ; 
Energie ¢ und dem Drehimpuls = erfolgt, sobald die Atome die Relativ- 


2% 
energie zwischen € — Q und € + fe — Q und den Drehimpuls zwischen 
l 
(n — 1) a und = besitzen. Dabei hängt Je durch die Gleichung 
T 
Aet = h mit t zusammen. 


Den Vorgang der Dissoziation müßte man sich auf Grund des ver- 
wendeten Bildes in zwei Schritten vorstellen. 1. Bildung eines Moleküls, 
dessen innere Energie größer als Q ist (durch Zusammenstoß oder 
Strahlungsabsorption). 2. Spontaner Zerfall dieses Moleküls. 


Der zweite Prozeß wird nur dann erfolgen, wenn das energiereiche 
Molekül nicht vorzeitig seine Energie wieder verliert. Umgekehrt 
verläuft die Molekülbildung so, daß die zunächst entstehenden energie- 
reichen AB-Moleküle durch weitere Zusammenstöße oder Ausstrahlung 
stabilisiert werden. Die Wahrscheinlichkeit für die Bildung energie- 
reicher AB-Moleküle haben wir bereits angegeben. Von den x vor- 
handenen werden, wenn wir von der Strahlung absehen, xS in der 
Zeiteinheit stabilisiert, wo S die vom Gesamtdruck abhängige Stoßzahl ist. 
Anderseits zerfallen pro Sekunde x/r. Im stationären Zustande ist: 


x 
CCa% = au x 8. 


Die Reaktionsgeschwindigkeit wird durch die Anzahl der pro Zeiteinheit 
stabilisierten Moleküle gegeben. Daher ist die Geschwindigkeitskonstante 
der Gesamtreaktion X 

“Ss 


BAR 
T 
Bei kleinen Drucken geht diese Gleichung (da T S wird) über in 
T 
2V2 x 
kebr LEER (6 a) 


~ PeVT Op — 
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Dé: lichen Da a (da 2 s) 
T 


2V2ahJQ 1 
7:22 VFO T 


Die Größenordnung der Lebensdauer z bestimmen wir aus den Versuchen 


VE ee (6 b) 


von Bodenstein und Lütkemeyer!) über die Geschwindigkeit der 
Reaktion 2Br— Br,. Wir führen dementsprechend ein 


K = 10138 ccm/Mol. sec — 3,65. 10-13cem/Atom see, 
ferner log J = — 37,277) und @ = 510°) und erhalten t = 3,5. 10-9 
Es steht hiermit in Übereinstimmung, daß die Reaktion zwischen den 
Drucken von 0,3 und 1 Atm. druckunabhängig verlief, da 2 hier etwa 


tS 
0,3 bis 0,1 beträgt. 
Aus den Messungen der Lebensdauer des aktiven Wasserstoffs 
von Wood‘) und Bonhoeffer’) folgt für die Reaktion 2H—H, 


log K = — 15. Aus Gleichung (6a) berechnet sich für H, (mit 
log J = — 40,70 = 5000) °%) für kleine Drucke (0,5 mm) 
logx = — 16,3. Dies ist kleiner als der gemessene Wert, was auf 


Wandreaktion beruhen kann. 
Handelt es sich um die Entstehung heteropolarer Verbindungen, so 
muB die Ausstrahlung mit berücksichtigt werden. Die Wahrscheinlichkeit 
dafür, daß das Molekül AB den zur Stabilisierung nötigen Strahlungs- 
verlust von & erleidet, sei 1/7’ pro Zeiteinheit. 
Dann gilt (wenn wir nun die Stabilisierung durch Stöße vernach- 
lässigen) 
tc 2V2xhJQ 1 


= THT eo l;?l2 VTO w°i2 t+ T (4) 


Eine Stabilisierung durch Strahlung ist anzunehmen bei der Bildung 
von NaJ aus den Atomen, die H. Beutler mit einem von uns’) 


1) ZS. f. phys. Chem. 114, 208, 1924. 

2) Berechnet aus der chemischen Konstante des Br,-Molekiils i — 2,546 
(Lüde und Suhrmann, ZS. f. Phys. 29, 71, 1924). 

3) Eucken und Fried, ZS. f. Phys. 29, 36, 1924. | 

4) Phil. Mag. (6) 42, 729, 1921; (6) Zu 538, 1922. Proc. Roy. Soc. (A) 97, 
455, 1921; 102, 1, 1922. 

5) ZS. f. Diye: Chem. 119, 199, 1924. 

6) Eucken und Fried, 1 c. 

7 H. Beutler und M. Polanyi, Die Naturwiss. 1925. 
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studiert. hat. Die Versuche können gedeutet werden mit einer Lebens- | 
dauer. r von 3.10-° und rt’ wr. 

4. Neben der einfachen Rekombination kann die Molekülbildung 
auch durch Dreierstöße vor sich gehen. Die Geschwindigkeit einer solchen 
Reaktion ist, wenn man die Dauer eines Stoßes zu 10-1?sec annimmt t), 

8? 
Cgesamt 
Wie ein Vergleich mit Gleichung (6) zeigt, bleibt die Geschwindigkeit 
der Dreierstoßreaktion fast immer unterhalb derjenigen des oben dis- 
kutierten Reaktionsweges. Natürlich ist hierin die wesentliche Annahme 
enthalten, daß die Querschnitte gastheoretisch berechenbar sind. 


cc - 10-12, 


1 2 


1) K. F. Herzfeld, ZS. f. Phys. 8, 132, 1922. 


Über die Interferenz von Eigenschwingungen 
als Ursache von Energieschwankungen 
und chemischer Umsetzungen 


M. Polanyi und E. P. Wigner 


Zeitschrift für physikalische Chemie A (Haber-Band) 43, 439-452 (1928) 


(Eingegangen am 10. 10. 28.) 


Inhaltsangabe. 


Nach alten Erfahrungen enthält die Konstante der Geschwindigkeit mono- 
molekularer Reaktionen in Lösungen und, wie sich neuerdings zeigt, auch in Gasen, 
neben einem exponentiellen Faktor, eine Grösse, die stets von etwa der gleichen 
Grössenordnung 1014 ist. Um dies zu erklären stellen wir uns vor, dass die Energie 
im Molekül in elastischen Wellen hin und her schwankt und die Umsetzung dann 
eintritt, wenn sich zufällig durch Interferenz der Wellen die Amplitude an einer 
bestimmten Bindung über einen kritischen Betrag erhöht. Das ergibt für die Ge- 


schwindigkeitskonstante ve ET , worin » die Frequenz der Atomschwingung ist, 
also die richtige Grössenordnung hat. 


1. Problemstellung. 


Unsere Arbeit ist darauf gerichtet, eine Vorstellung über den 
Mechanismus monomolekular verlaufender Reaktionen (Dissoziation 
und Umwandlung) grösserer Moleküle zu gewinnen. Die monomole- 
kularen Gasreaktionen sind in letzter Zeit vielfach diskutiert worden, 
doch gehen die Arbeiten (die später noch berührt werden sollen) auf 
den Mechanismus innenmolekularer Vorgänge nicht ein, sondern be- 
nutzen zu deren Kennzeichnung eine Konstante, die mit keiner an- 
deren Erscheinurg in Zusammenhang gebracht wird. Unsere Vor- 
stellung soll zur Deutung dieser Konstante führen. Sie geht davon 
aus, dass das System aus einer Anzahl von Atomen besteht, die durch 
quasielastische Kräfte an Gleichgewichtslagen gebunden sind. Die 
möglichen Bewegungen solcher Systeme lassen sich aus Eigenschwin- 
gungen zusammensetzen, die durch Interferenz (Schwebung) zuweilen 
an einzelnen Stellen eine erhebliche Amplitude erzeugen können. 
Wenn man z. B. vier in einer Geraden angeordnete Atome hat, so hat 
das System drei Eigenschwingungen. Wenn nur eine dieser Eigen- 
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schwingungen angeregt ist, so ist die Amplitude aller Atome zeitlich 
konstant und von gleicher Grössenordnung. Sind dagegen etwa alle 
drei Eigenschwingungen erregt, so werden die Schwebungen zur Folge 
haben, dass jede einzelne Bindung zeitweise den grössten Teil der 
Schwingungsenergie aufnimmt. Hierbei wird. eine Umwandlung ein- 
treten können, indem eine Konfiguration entsteht, die beim Abfluss 
der Energie nicht stets in den Ausgangszustand zurückkehrt!), son- 
dern mit etwa gleicher Wahrscheinlichkeit in eine neue Gleichgewichts- 
lage übergeht. Auch eine Dissoziation kann auf diese Weise eintreten, 
wenn die kritische Elongation die Reichweite der molekularen Kräfte 
erreicht. Von der Erörterung sind gemäss der obigen Vorstellung Bil- 
dung und Zerfall zweiatomiger Moleküle im Gasraum von vornherein 
ausgeschlossen. Dieser Vorgang war Gegenstand einer früheren Notiz 
der Verfasser?). Er wurde in Analogie zum Augerprozess gebracht. 
Kurz nachdem K. F. BONHOEFFER und L. FARKAS?) nachgewiesen 
hatten, dass der photochemische NH,-Zerfall einen dem Augerprozess 
analogen Vorgang darstellt, ist diese Frage mit Hilfe der neuen Quan- 
tenmechanik von R. DE L. KRoNIG?) angeschnitten und als typisch 
quantenmechanischer Effekt gekennzeichnet worden. Die Geschwin- 
digkeit von. Vorgängen, die durch diesen Effekt bedingt sind, ist 
wesentlich geringer als die der im Text betrachteten Reaktionen. 

Eine Vernachlässigung ist die Annahme quasielastischer Bin- 
dungen, womit die Streuung der elastischen Wellen und damit die 
Überlagerung der gewöhnlichen Warmeleitung®) wegfällt. Auch ist 
die Zuständigkeit unserer Ableitung voraussetzungsgemäss auf Um- 
wandlungen beschränkt, die mit keinem Elektronensprung verbun- 
den sind. 

Eine weitere Einschränkung ist durch die Verwendung der A 
schen Vorstellungen an Stelle der quantenmechanischen bedingt. Da 


1) Es war bisher üblich, den Reaktionsvorgang in zwei verschiedenartige Teile 
zu zerlegen: die „Aktivierung“ und die „Umwandlung“. Die „Aktivierung“ soll 
durch intramolekulare Energieübertragung beliebig schnell erfolgen, die ,, Umwand- 
lung“ dagegen spontan gemäss einer gewissen, die Reaktionsgeschwindigkeit we- 
sentlich bestimmenden Lebensdauer des aktivierten Zustands. Diese Auffassung 
lehnen wir ab, weil zufolge der Umkehrbarkeit der Elementarprozesse jener Vor- 
gang, der für die eine Reaktionsrichtung „Aktivierung“ ist, für die Gegenreaktion 
„Umwandlung“ sein muss (unsere eigene Vorstellung entspricht dieser Forderung). 
2) Z. Physik 33, 429. 1925. 3) K. F. BONHOEFFER und L. Farkas, Z. physikal. 
Chem. 134, 337. 1928. 4) R. DE L. Krontia, Z. Physik 50, 347. 1928. 5) Vgl. 
P. DEBYE, Vortrag über die kinetische Theorie der Materie. Leipzig 1914. 
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es sich hier jedoch um Zustände mit hohen Quantenzahlen handelt, 
wird auch die strengere Behandlung mit Hilfe der Quantenmechanik 
kein abweichendes Ergebnis liefern !). Allerdings bringt es die Quanten- 
mechanik mit sich, dass Umwandlungen, die gemäss unserer Vorstellung. 
als möglich erscheinen könnten, es doch nicht sind, weil ausser den 
klassischen Integralen auch noch weitere Integrale der Bewegungs- 
gleichungen existieren, die die Wahrscheinlichkeit gewisser Übergänge, 
z. B. von einem Singulett- in einen Triplettzustand auf ein sehr ge- 
ringes Mass reduzieren?). 

Ausser dem hier beschriebenen Reaktionsmechanismus eröffnet 
die Quantenmechanik noch eine ganz andere Möglichkeit für den Um- 
wandlungsvorgang. Es kann das nichtmechanische Überspringen der 
die beiden Modifikationen trennenden Energiebuckel eintreten, ähn- 
lich, wie dies bei radioaktiven Prozessen der Fall ist®). Ein solcher 
Prozess hätte jedoch einige sonderbare Merkmale, von denen es zu- 
nächst nicht wahrscheinlich erscheint, dass sie auf die bekannten Reak- 
tionen zutreffen. Insbesondere müsste die Reaktionsgeschwindigkeit 
beim absoluten Nullpunkt noch einen endlichen Wert behalten und 


Q 
mit der Temperatur nicht gemäss Ae RT, sondern viel langsamer 
ansteigen. 


1) Im Falle quasielastischer Bindungen kann man — einer Arbeit von 
SCHRÖDINGER folgend — (Die Naturw. 14, 664. 1926) — den Übergang zu hohen 
Quantenzahlen auch im einzelnen durchführen. 

Klassisch lässt sich die Bewegung als Superposition von Eigenschwingungen 
auffassen. Führen wir anstelle der Koordinaten der einzelnen Teilchen » Normal- 
koordinaten« ein, d. h. solche, die bei allen Eigenschwingungen, abgesehen von 
einer konstant bleiben, bei dieser Eigenschwingung dagegen eine reine Sinus- 
bewegung ausführen, so bewegt sich der Bildpunkt des Systems im Konfigurations- 
raum nach der Gleichung | 
qr = a, sin (wg + Ppr) t. (œ) 

Quantenmechanisch bedeutet das Einführen von Normalkoordinaten die Se- 
paration der Variablen. Für jede Normalkoordinate gilt die Gleichung eines 
reinen, harmonischen eindimensionalen Oszillators. In der erwähnten Arbeit von 
E. SCHRÖDINGER ist dann explizite gezeigt, dass die Koordinate q, sich im 
wesentlichen nach der Gleichung («) bewegt. 

Aber auch für nichtquasielastische Kräfte muss die Quantenmechanik bei 
hohen Quantenzahlen mit der klassischen Theorie äquivalente Resultate liefern, 
wenn es auch schwerer sein dürfte, den Beweis im ‘einzelnen durchzuführen. 

2) E. Wiener, Göttinger Nachr. 375, 1927. Nach freundlicher Mitteilung von 
Herrn F. Lonpon ist er mit der Erörterung dieser Seite der Frage beschäftigt. 

3) G. Gamow, Z. Physik 51, 204. 1928. 
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2. Energieschwankungen innerhalb eines festgefügten Systems. 


Wir denken uns ein Atomaggregat, das aus n ungefähr gleich 
schweren Atomen besteht, die von etwa gleich grossen Kräften zu- 
sammengehalten werden. Die rascheste Eigenschwingung habe die 


l s : j ; | : 
Periode PE die Gesamtenergie des Systems sei #. Wir fragen wie 


lange es im Mittel dauern wird, bis an einer hervorgehobenen Bindung 
zwischen .zwei Atomen eine Energie auftritt, deren Betrag e über- 
schreitet. Hierzu fassen wir zunächst die statistische Wahrscheinlich- 
keit. dieses Zustandes ins Auge. Diese ist!): 


wo 1-5) i 


Der Ausdruck geht für grosse Werte von n über in die BoLTZ- 
MANNsche Formel für Stoffe mit der spezifischen Wärme k 


W=e ET, (la) 


wenn man = = kT setzt. 
37 


Andererseits gewinnt man eine Abschätzung der mittleren Dauer t 
eines solchen Zustandes, wenn man die Fortschreitungsgeschwindig- 
keit einer Störung gemäss der Schallgeschwindigkeit v ansetzt: 


eg (2) 


wo d den Abstand der Nachbaratome bedeutet. Für v ergibt sich aus 
der Wellenlänge der raschesten Schwingung, welche d beträgt, gemäss 


vA =v v = vd (3) 
und hieraus Tes a (3 a) 
F | 


In Verbindung mit (1) bzw. (1a) erhalten wir für die gesuchte Zeit Z, 
innerhalb deren es im Mitteleinmal vorkommt, dass die heryorgehobene 
Bindung die Energie € hat 


l £ — 3n +1 

z-— (1-5) (4) 
i! eee 

Z=—e RP. (4a) 


Diese Überlegung ist insofern ungenau, als darin angenommen 
wird, dass die Amplituden benachbarter Atome voneinander stati- 


1) H. A. Lorentz, Theories statistiques en thermodynamique. Leipzig et 
Berlin 1916. 
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stisch unabhängig sind. Doch führt folgender Weg unter Vermeidung 
dieser Annahme zu einem im wesentlichen gleichlautenden Ergebnis. 

Betrachten wir eine Atomreihe von n-Atomen, in dem jedes Atom 
der Einfachheit halber dieselbe Masse m hat. Die Entfernung des k-ten 
Atoms von der Gleichgewichtslage sei x,., die Kraft, die das k-te Atom 
auf das k—1-te ausübt, sei a (&._,—%;,). Dann sind die Bewegungs- 


leichungen $ 
& 5° Map = Q (Ek 41 — Ly) + A (Ek — 1 — Ax). 


Die Kreisfrequenzen w der Eigenschwingungen sind die Lösungen 
der algebraischen Gleichung n-ten Grades 


— mw? -H-a — a 0 0 0 
—& — mow? +a — a 0 0 
0 —a —mw?+2a —a 0 = 0. (5) 
0 0 — a4 —mw? H-2 —a 
0 0 0 —& — moa? + a| 


2 
Führen wir hier für = die neue unbekannte y ein, so lautet (5) 


1-y —1 0 0 0 | 

—1 2—y —] 0 0 
0 —1 2—y — 1 0i = 0. (5a) 
0 0 —] 2—-y — 1 
0 0 0 —1 1—y 


Die Determinante (5a) hat n Zeilen. Für kleine n kann man die 
Lösungen von (5a) explizite berechnen, für grosse n noch leicht (im 
wesentlichen mit Hilfe der 3 2 
DescArTeEsschen Regel) die 
Anzahl der Lösungen in ver- n=: —————_, 
schiedenen Intervallen an- | 
geben!). .Da alle y zwischen 


0 und 4 liegen, können wir die n- s r 
folgenden Intervalle von y be- 


trachten: 0 bis 1, 1 bis 2, 2bis „_. mm — 
3, 3 bis 4. Das Resultat ist in Fie. 1 


Fig. 1 zusammengefasst. Wir 
sehen, dass die Eigenfrequenzen ungefähr gleichmässig auf das ganze 
Intervall verteilt sind. So lange die Massen der einzelnen Atome und 


1) Für das langwellige Gebiet siehe auch P. DEBYE, Ann. Phys. 39, 789. 
1912. Vgl. auch M. Born, Atomtheorie des festen Zustandes. Berlin 1923. Da- 
selbst weitere Literatur. 
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die zwischen ihnen liegenden Kräfte von der gleichen Grössenordnung 

sind, wird sich hieran nichts ändern. Den Fall, dass eine Bindung 

wesentlich schwächer ist, als die übrigen, werden wir weiter unten 

kurz berücksichtigen. Die Entfernung von zwei benachbarten Atomen 

d als Funktion der Zeit ¢ wird die Gestalt 

a nen BET tree 
+ Gy COS Wy (t T Pn) 

haben. Führen wir hierfür die Kreisfrequenz der raschesten Schwin- 


Sms 2. 2 or dr 
gung 2rv ein, so können wir im Intervall — 7 he T mit hin- 


reichender Genauigkeit 


27 N 32 , ” 5-2 ” 
d=d,+ccos—— (t+ 9) +e cos — ¢+go)+c cos ==" tg") 


1.2 sv 


-+ c”’ cos +9") (6) 
schreiben. Ein hoher Wert für d—d, zur Zeit 2=0 wird wegen der 
gleichmässigen Verteilung der Eigenfrequenzen auf das Intervall in 
den allermeisten Fällen so zustande kommen, dass die Koeffizienten 
c, c’, c”, c” schon selber sehr hohe und zwar ungefähr gleich hohe 
Werte haben. Ausserdem müssen die » alle sehr klein, mit anderen 


Worten, die vier Schwingungen von (6) in Phase sein. Wenn d 


besonders hoch ist, so dass ah). >e ist, wird im Mittel 


— d,)2 

ac) N 
2 

sein. Noch grössere Uberschreitungen werden sehr selten sein. Man 


sdy 
überzeugt sich mit Hilfe von (6) leicht, dass der Zustand Zur >eE 


im Mittel, während einer Zeit von der Grössenordnung a 2kT 
y € 
anhalten wird. Da seine statistische Wahrscheinlichkeit kT e ET 
€ 


€ 
ist, wird er pro sec ve kT-mal erreicht werden. 
Bei genauerer rechnerischer Durchführung dieser Überlegung er- 


gab sich 1-15-v-e kT an Stelle von v'e kT. Hierbei ist die in 
Fig. 1 gegebene Verteilung der Eigenschwingungen angenommen, sowie 
dass e gross gegen kT ist. 
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Das Ergebnis gilt den Voraussetzungen gemäss nur, wenn die 
Umwandlung durch die Elongation längs einer einzigen Koordinate 
also etwa durch Längung einer einzelnen Bindung vor sich geht. 
Kann eine Reaktion durch Elongation eines Atoms (oder Radikals) nach 
allen drei Raumrichtungen erfolgen (,,dreidimensionale Umwandlung‘“‘), 


wie dies besonders bei einem mehrfach gebundenen Atom der Fall 
sein kann, so wird die Reaktionsgeschwindigkeit um den Faktor en 
vergrössert. Für dreidimensionale Umwandlungen ergibt sich hiermit 
die Reaktionsgeschwindigkeitskonstante zu 


__ 2» nn 1) 
Au —— pa 
kT 


Bei tiefen Temperaturen sind die Schwingungen nicht sämtlich 
- angeregt. Da hierbei die rascheste Schwingung, die noch angeregt wird, 


eine Frequenz = hat, tritt diese Grösse an Stelle von ». 


Bisher wurde der Fall betrachtet, dass die Festigkeit von allen 
Bindungen von gleicher Grösse ist. Es besteht aber auch das Be- 
dürfnis, sich über Fälle zu orientieren, wo das nicht zutrifft, z. B. 
ein adsorbiertes Molekül an einer Wand, sowie der Fall zweier Mole- 
küle, die durch Zusammenstoss im Gasraum miteinander in Berührung 
getreten sind. In solchen Systemen ist eine Bindung wesentlich 
schwächer als die anderen und zwar von der Grössenordnung der 
Kohäsionskräfte, das ist etwa zehnmal kleiner als die interatomaren 
Bindungen. 

Es ist klar, dass das Vorhandensein einer schwachen Bindung 
zwischen zwei schwingenden mehratomigen Gebilden einen geringeren 
Energiefluss ermöglicht, als wenn die gleiche Stelle stark gebunden 
wäre. Ist keine Kraft vorhanden, verschwindet ja der Energiefluss. 


3. Monomolekulare Reaktionen in Gasen und Lösungen. 


Wir wollen hier die Umwandlung bzw. den Zerfall von Molekülen 
ins Auge fassen, die gross genug sind, um mit hinreichender Genauig- 
keit der Gleichung (4a) zu entsprechen. Das bedeutet, dass die Zufuhr 
von Energie an das Molekül als geschwindigkeitsbestimmender Faktor 
zurücktritt und es lediglich auf die inneren Vorgänge ankommt. Dies 

ee 


1) Für den 2-dimensionalen Fall gilt entsprechend Sam e 
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ist dann der Fall, wenn das Molekül gross genug ist, um stets die 
Energie £ in sich zu tragen und auch dann, wenn es von einer so grossen 
Zahl äusserer Impulse getroffen wird, dass hierdurch die nötige Energie- 
zufuhr gewährleistet ist. 

Die Geschwindigkeitskonstante x solcher monomolekular verlau- 
fender Reaktionen soll gemäss (4a) 


ec. A (7) 
sein, worin Q die Aktivierungswärme bedeutet. 


Für dreidimensionale Umwandlungen geht dies, wie oben aus- 
geführt wurde, über in 


Der ‚temperaturunabhängige Faktor“ der monomolekularen 
Reaktionen ist also gleich der Atomfrequenz, bei dreidimensionalen 


Reaktionen noch mit Lip multipliziert. Die Grössenordnung der Atom- 
frequenz ist 5:1012—5-1013 sec=!, somit wäre 


Q Q 


x= 510%e RT bis 510!e AT. 

Welche Zahl aber bis zum 20fachen Wert ansteigen kann, wenn die 
Reaktion durch Verschieben in mehreren Richtungen erfolgen kann. 

Dieses Ergebnis steht in Übereinstimmung mit der Erfahrung, 
dass der temperaturunabhängige Faktor der Reaktionsgeschwindig- 
keitskonstante monomolekularer Reaktionen, stets nahezu den glei- 
chen in der Grössenordnung 1013 bis 1014 gelegenen Wert aufweist. 
Diese Regel kommt bereits in der von S. DUSHMANN!) aufgestellten 
Gleichung 


hive 


z — pře KET 


zum Ausdruck, da die Abweichungen zwischen »* = —_ und unserem 


~ Nh 


2 l : 
v (baw. ~~“) innerhalb der Streuung der Werte gelegen sind. 


20 Gd) 

| RT 
Einen Überblick über die temperaturunabhängigen Faktoren mono- 

molekularer Reaktionen gibt die Fig. 2. In die Figur sind aufgenommen 


die in Lösung verlaufenden Reaktionen, die von J. A. CHRISTIANSEN?) 


1) S. Dusumann, J. Amer. Chem. Soc, 43, 397. 1921. 2) J. A. CHRISTIANSEN, 
Z. physikal. Chem. 104, 451. 1923. 
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zusammengestellt worden sind, sowie die Werte für folgende Gas- 
reaktionen: 2N,0,= 2N,0,-+ O,1); Zersetzung von CH,COCH,?); 
CH,N,CH; = N; + C,H,*); Zersetzung von C,;H,N,Cl,H,*); CH,0CH, 
= CH, + H, + C05); Zersetzung von C,H,OC,H;°); Zersetzung von 
CH,CH,CHO’); Racemisierung von Pinen?). 

Die Streuung kann zum Teil durch die Unsicherheit der Messung 
(namentlich der Aktivierungswärme) verursacht sein, doch lässt die 
Näherung, in der wir unsere Betrachtung durchgeführt haben, auch 
die Möglichkeit echter Abweichungen zu. 

Bei den ausnahmsweise stark nach oben herausfallenden Werten 
ist die Möglichkeit zu berücksichtigen, dass die Faktoren durch Ketten- 


Anzahl der Reokhonen in den er- 
zelnen Intervallen 


0 17 5 70 2 715 
Größenoronung der A-Konstante der Gleichung #=Ae RT in Zehnerpofenzen 


Fig. 2. 


reaktionen vergrössert werden können. Für Abweichungen nach unten 
könnte, falls sich diese als reell herausstellen, eine Erklärung durch 
Verfeinerung der Theorie gefunden werden. 

Bevor wir uns weiteren Anwendungen der Gleichung (7) zuwenden, 
sei auch noch der Reaktionsverlauf in Gasen bei niedrigen Drucken 
kurz betrachtet. Im Grenzfalle kleiner Drucke ist stets die Zufuhr- 
geschwindigkeit der Energie für die Reaktionsgeschwindigkeit mass- 


1) DANIELS tind JOHNSTON, -J. Amer. Chem. Soc. 43, 53. 1921; G. SPRENGER, 
Z. physikal. Chem. 136, 49. 1928. 2) HırscH und HINsSHELWooD, Proc. Royal 
Soc., London A .111,.245. 3) RAMSPERGER, J. Amer. Chem. Soc. 49, 912, 1495. 
1927. 4) RAMSPERGER, J. Amer. Chem. Soc. 50, 617. 1928. 5) HırscH und 
ASKEY, Proc. Royal Soc., London A 115, 215. 1927. 6) HırscH, Proc. Royal 
Soc., London A 114, 221. 1926. 7) Hırsch und Tompson, Proc. Royal Soc., 
London A 113, 221. 1926. 8) SMITH, J. Amer. Chem. Soc. 49, 43. 1927.. 


58 Part I. Physical Chemistry 


448 M. Polanyi und E. Wigner 


gebend. Wenn bei jedem Stoss voller Energieaustausch zwischen den 
zusammenstossenden Molekülen stattfinden würde, so wäre die Wahr- 
scheinlichkeit für die Übertragung einer Energie, die grösser als € L ist 


N 
BEER (a) ee ET 
(s — 1)! \kT | 


und damit die Reaktionsgeschwindigkeitskonstante x 


1 e ee —— 
le © ** 
worin S die Stosszahl bedeutet. 

Diese Gleichung ist nun mit Rücksicht darauf zu modifizieren, 
dass der Energieaustausch, wie in Abschnitt 2 erwähnt, durch die 
schwache Bindung, welche zwei aneinander stossende Moleküle ver- 
bindet, nur in beschränktem Masse vor sich geht, was zu einer Ver- 
minderung von x führt. 


4. Verdampfungsgeschwindigkeit. 


Eine weitere Anwendung können unsere Vorstellungen bei der 
Verdampfungsgeschwindigkeit finden. Wir wollen diesen Fall jedoch 
nur als Beispiel für unsere Vorstellungen behandeln, da die Ge- 
schwindigkeit des Prozesses selbst schon durch frühere Betrachtungen?) 


1) M. Poranxı, Z. Physik 1, 337. 1920. Mit Hilfe dieser Gleichung hat 
E. WIGNER (Diss., Technische Hochschule, Berlin 1925) gezeigt, dass die Geschwin- 
digkeit der bis dahin bekannten monomolekularen Gasreaktionen durch Zufuhr von 
Energie auf dem Wege der Zusammenstösse erklärt werden kann. Dasselbe unter- 
suchen G. N. Lewis und D. F. SMITH, J. Amer. Chem. Soc. 47, 1514. 1925. Von 
J. A. CHRISTIANSEN, Proc. Camb. Phil. Soc. 23, 438. 1926 und HinsHELWwoop, Proc. 
Royal Soc., London 113, 230. 1926 wurde diese Frage weiter bearbeitet und gefördert. 
Den Übergang vom Grenzgebiet kleiner Drucke zu dem monomolekularen Verlauf, 
der bei hohen Drucken eintritt, haben RICE und RAMSPERGER (J. Amer. Chem. Soc. 
49, 1617. 1927), sowie auch I.. S. KASSEL (J. physic. Chem. 32, 225. 1928) neu be- 
trachtet, indem sie die Notwendigkeit des Übergangs der Aktivierungswärme auf 
einen bestimmten Freiheitsgrad zugrunde legten. An dem Prgebnip ihrer Arbeit 
wird durch unsere Ausführungen nichts geändert. 

2) B. BAULE, Ann. Phys. 44, 145. 1914; J. LANGMUIR, Physical Review 8, 
149, 1916. Siehe die zusammenfassende Darstellung bei K. F. HERZFELD, l. c, 
S.229ff. (Die von HERZFELD an anderer Stelle des Buches gegebene direkte 
Berechnung der Verdampfungsgeschwindigkeit ist wegen stillschweigenden An- 
nahmen, die sie enthält, nicht als strenge Ableitung zu betrachten.) 
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mit Hilfe des Gleichgewichts und der Riickreaktion berechnet worden 
ist. Unsere Behandlung ist nur insoweit befriedigender, als sie die 
Geschwindigkeit der Verdampfung auch direkt abzulesen gestattet. 
Da das Wegfliegen eines Atoms vom Kristall nach allen drei 
Raumrichtungen erfolgen kann, handelt es sich hier um eine ,,drei- 
dimensionale Umwandlung“. Die Wahrscheinlichkeit œ dafür, dass 
ein bestimmtes Atom in der nächsten Sekunde verdampft, ergibt sich 
somit gemäss (7a) 
A TET 
p = 2y RTE ; (9) 
worin 4 die Verdampfungswärme bedeutet. 
Eine Bestätigung dieser Gleichung finden wir darin, dass sie in 
Verbindung mit der Dampfdruckgleichung für hohe Temperaturen!) 
(c Konzentration des gesättigten Dampfes) 


zur Erfahrungstatsache führt, dass ein Atom (Molekül), das auf sein 
Kondensat trifft, dort haften bleibt: Bezeichnen wir nämlich die Wahr- 
scheinlichkeit hierfür mit w, so ergibt sich (d = Gitterkonstante) 
VE Yonah te 
27 m ERT 
woraus sich w zu 


W = — ---- Zu 1 (10) 


berechnet wie man durch Einsetzen der Werte von A, m, v findet 

Bei tieferen Temperaturen wird unser Bild etwas komplizierter, 
es zeigt sich nämlich, dass, wenn kT < hv wird, der Akkomodations- 
koeffizient ausgesprochen abfallen muss. 

Auch durch direkte Anwendung unserer Grundvorstellungen 
kommt man zum gleichen Ergebnis. Wenn ein Molekül nämlich auf 
sein Kondensat trifft, so muss es dort — um wieder wegfliegen zu 
können — wenigstens eine halbe Schwingung ausführen. Während 
dieser Zeit wird es aber schon mit erheblicher Wahrscheinlichkeit so 
viel Energie (~ kT) verlieren, dass es haften bleibt. Es ergibt sich 


1) Siehe z. B. K. F. HERZFELD, Kinetische Theorie der Wärme. S. 223. 1925. 
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also, die Wahrscheinlichkeit w für die Kondensation auftreffender 
Moleküle zu w1?). 

Der Vollständigkeit halber wollen wir noch bemerken, dass diese 
letztere Überlegung nur dann gilt, wenn das Atom eine empfindliche 
Kugel seines Kondensats trifft, d.h. auf eine solche Stelle fällt, wo 
die potentielle Energie kleiner als die Verdampfungswärme ist. Dass 
diese empfindlichen Kugeln die Oberfläche eines Kristalls praktisch 
vollkommen bedecken, kommt in (10) zum Ausdruck. 


5. Wandreaktion. 


Wir haben oben die Geschwindigkeit monomolekularer Reak- 
tionen im Gasraume und in Lösung betrachtet. Es ist anzunehmen, 
dass ein Molekül, das an einer Wand reagiert, in gleicher Weise zu 
betrachten ist, da auch hier die Bedingung ausreichender und daher 
die Reaktionsgeschwindigkeit nicht beschränkender Energiezufuhr 
vorliegt. Eine Abschätzung der Geschwindigkeit von Wandreaktionen 
ergibt sich auf dieser Grundlage wie folgt: 

Die Anzahl N, der an der Oberfläche » adsorbierten Moleküle 
bei der Konzentration c im Gasraume ergibt sich aus der Adsorp- 
tionswärme W, wenn wir Dicke d der Wirkungssphäre der Adsorp- 
tionskräfte zu 10-8 cm annehmen: 


DA LA 
N, = wcde ET —=w.10”-8ceHT. 
Der Anteil der pro Sekunde umgewandelten Moleküle ist gemäss (7) 
1dN __ -5 


Ope RE 
N, dt a 


wo Q die Aktivierungswärme im adsorbierten Zustande bedeutet. 


1) Vgl. auch die zitierte Arbeit von B. BAULE. Man könnte daran denken, 
den Kondensationsvorgang vom Standpunkt der gewöhnlichen Wärmeleitung zu 
behandeln, indem man versucht, den Wärmefluss zu berechnen, der während des 
Aufstossens und Rückprallens stattfindet. Dieser Weg ist jedoch grundsätzlich zu 
verwerfen, da die Wärmeleitungsgleichungen nur in Bereichen gültig sind, die gross 
sind gegenüber der DesByYEschen ,,mittleren Weglänge“ der Wärmeleitung. (Es 
kommt in dem hier behandelten Falle noch hinzu, dass der Temperaturgradient 
extrem gross ist, so dass auch in dieser Hinsicht die Voraussetzungen der Wärme- 
leitungsgleichungen nicht erfüllt sind.) In der Tat erhält man nach dieser Rechnungs- 
weise für 1 — w eine sehr kleine Zahl, was der Erfahrung nicht entspricht, da 1 — w 
(wie w selbst) in der Grössenordnung von Eins gelegen ist (vgl. P. HARTECK, Z. 
physikal. Chem. 134, 1. 1928). 
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Hieraus ist 
Ah _ Q = l W 
a cwdve ET a wcle Fel. 


Die Anzahl N* der auf die Fläche W treffenden. Moleküle ist 


V RT 
* — md 4 
N w sa ° 104 we 


(T =600, Molekulargewicht = 60), demnach 


dc u 
of 100 N* RT. 
dt 09 i 


Der temperaturunabhängige Faktor ergibt sich demnach im all- 
gemeinen nicht gleich der Anzahl der auf die Wand treffenden Mole- 
küle, sondern wird meist erheblich grösser sein, als diese. 

Die Messungen von SCHWAB und PIETScH!) über die Zersetzung 
von Methan ergeben für den temperaturunabhängigen Faktor den 
Wert 1045-N*. Auch die Messungen der Oxydation von Wolfram 
durch Sauerstoff, die von LANGMUIR?) ausgeführt wurden, sowie noch 
unveröffentlichte Messungen von L. Marton und M. PoLAnyı?) über 
die Einwirkung von H,O auf Wolfram zeigen diesen Effekt; die tempe- 
raturunabhängigen Faktoren betragen hier 10:5- N* bzw. 10°-N*. Es 
scheint uns somit, dass diese Erscheinung allgemein verbreitet ist. 
Neben der grundsätzlichen Erklärung, die wir oben angegeben haben 
(die natürlich noch weitgehend verfeinert werden müsste) ist noch 
zu berücksichtigen, dass (wie schon ScHwAB und PIETSCH erwähnen) 
die Oberfläche durch Zerklüftung vergrössert sein kann, wodurch (bei 
grossen freien Weglängen) die adsorbierte Menge wächst, ohne dass 
die berechnete Anzahl der Wandstösse sich ändert. 


Schlussbemerkung. 


Innerhalb eines durch feste Bindungen zusammengehaltenen 
Systems von Atomen führt die Interferenz der Eigenschwingungen zu 
Schwankungen der Amplitude der einzelnen Atome, die folgender Art 
sind: Die Energie eines Atoms ändert sich jeweils im Mittel während 


einer Zeit — in gleichem Sinne. Während dieser Zeit beträgt die Ande- 


1) ScHwAB und PIETSCH, Z. Elektrochem. 32, 430. 1926. 2) J. LANGMUIR, 
J. Amer. Chem. Soc. 35, 105. 1913. 3) L. Marton und M. Poranyi, Aus dem 
Forschungslaboratorium der Vereinigten Glühlampenfabriken Ujpest. 


62 Part I. Physical Chemistry 


452 M. Polanyi und E. Wigner, Über die Interferenz von Eigenschwingungen usw. 


rung der Energie im Mittel etwa kT. Die Häufigkeit, mit der die 


Energie auf den Betrag e (e > xT) anwächst, ist ~ve kT T sec™t, Die 
Atomfrequenz hat also die Bedeutung einer Wechselzahl der Schwin- 
gungsenergie. 

Wenn eine chemische Reaktion oder sonstige Umwandlung durch 
Überschreitung einer kritischen Elongation eines bestimmten Atoms 
erreicht wird, indem ein Energieberg überschritten wird (Umwand- 
lung), oder das Atom auf ein höheres Energieplateau hinaufgeschafft 
wird (Dissoziation, Verdampfung), so berechnet sich demnach die 


Häufigkeit eines solchen Vorganges zu ve KT, 

Das Ergebnis liefert die Erklärung der alten, Erfahrung, dass der 
temperaturunabhängige Faktor der Geschwindigkeitskonstante mono- 
molekularer Reaktionen in der Grössenordnung von 10!* gelegen ist. 
Auch die Verdampfungsgeschwindigkeit kommt im Einklang mit der 
Erfahrung heraus. Es erklärt sich ferner, wieso der temperaturunab- 
hängige Faktor bei Wandreaktionen vielfach grösser herauskommt, 
als die Anzahl der gegen die Wand stossenden Moleküle. 

Wenn das System eine schwache Bindung enthält (Moleküle im 
Zusammenstoss, Moleküle an der Wand), so ist an dieser Stelle die 
Energieübertragung schwächer, als über eine feste Bindung. 

Unsere Behandlung des Gegenstandes war nicht darauf gerichtet, 
dass die Geschwindigkeitsgleichungen streng richtige Werte für das 
Gleichgewicht liefern. Das Ziel war, für praktisch wichtige Fälle die 
Geschwindigkeitskonstante abzuschätzen. 


The Statistics of Composite Systems According 
to the New Quantum Mechanics 


E.P. Wigner 


(Original in Hungarian with German abstract). Összetett Rendszerek Statisztikdja 

az Uj Quantum-Mechanika Szerint. Magyar Tudományos Akadémia Matematikai 

és Természettudományi Értesitöje 45, 576-582 (1929). Translated from Hungarian 
by Akos Sebestyén 


1. Only one kind of statistics applies in classical mechanics. In fact the behavior 
of a system is completely (i. e., also microscopically) determined if the equations 
of motion are known. The latter does not influence the macroscopic behavior of 
the system since according to the quasi-ergodic hypothesis, it will get arbitrarily 
close to any state of the given energy after a long enough time, and statistical 
mechanics deals with averages when the system goes through all such states 
many times. We obtain the probability of the macroscopic state of the system 
according to this classical, or, as it is called, Boltzmann statistics if we calculate 
the number of ways its various parts can be placed in the phase-space cells 
compatible with this state. In counting these distributions we must distinguish 
two states in which the same number of atoms are placed in the same cells but 
they differ from each other through an interchange of atoms. In combinatorics 
this rule is expressed by considering the atoms as distinguishable, and we must 
calculate the number of possible distributions of the atoms in the cells. 

All these can be derived from classical mechanics and the quasi-ergodic 
hypothesis recalled before, in the simplest way with the help of Liouville’s 
theorem. (See e.g. “Introduction to the Corpuscular Theory of Matter” by 
R. Ortvay, (1927) Budapest.) 

If we consider the combinatorial statement in classical statistics: “The 
atoms have to be distributed in the cells but if atoms in two different cells in 
one distribution are interchanged compared to another, then these two distri- 
butions must be considered different even if atoms of identical chemical nature 
are interchanged (for instance in case of two atoms of hydrogen)” , then it seems 
to be obvious to relinquish the latter condition and to consider atoms of the 
same kind indistinguishable. Bose, who originated this idea has shown that if 
we apply this statistics to a gas consisting of light quanta we obtain directly 
Planck’s radiation law. 

As an example we shall apply these statistics for two atoms and two cells, 
an even more simple case. We denote the two cells by two squares and the two 
atoms in Boltzmann’s case by the numbers 1. and 2., and in Bose’s case by two 
points. 


* Editor’s Note: There are two slips at the end of the paper and in the abstract. 
Contrary to what is stated there, nitrogen (!?N) nuclei satisfy Bose statistics as, 
more generally, do all nuclei that contain an even number of particles. 
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Boltzmann Bose 


We see, that in Boltzmann’s case the probability that the two atoms are 
in different cells is equal to the probability that they are in the same cell. 
According to Bose the latter is twice as large as the former. This is very suitable 
to characterize Bose’s statistics qualitatively. 

Shortly after Bose’s investigations, Einstein applied Bose statistics to an 
ideal gas. He found interesting phenomena of degeneracy which, following him, 
have also been described by many others and which we can understand if we 
suppose that the probability for two atoms to reside in the same cell of phase- 
space is larger than what is claimed by classical statistics. 

Soon after Einstein’s paper, Fermi stressed that, in contrast, our experience 
concerning the electron structure of atoms does not suggest a larger probabil- 
ity for placing two atoms in the same cell than in classical statistics. On the 
contrary, Pauli’s principle establishes (on the basis of experience gained from 
spectra) that no state can exist in which a cell of phase-space contains more 
than one particle. Fermi took 0 for the probability of such states while the apri- 
ori probabilities of those states are equal where there is at most one particle in 
the cell. This is called Fermi statistics. 


2. As we see, setting up other statistics than Boltzmann’s (the Bose and 
Fermi types) was completely arbitrary. Quantum mechanics however estab- 
lished to a certain extent the motivation for these two new statistics. 

We have noted before that in addition to mechanics the quasi-ergodic hy- 
pothesis is also needed to formulate Boltzmann statistics. 

Heisenberg [1] and Dirac [2] have independently noticed that this hypothesis 
does not apply in quantum mechanics. They have shown that for instance in 
the case of two electrons there exist two groups of states, the symmetric and 
antisymmetric states which do not approach each other. The symmetric states 
are characterized by wave functions 


W(r1,72,+++5Tn) 
(where 71,72,...,T are the coordinates of the particles) symmetric in the ar- 
guments; that is 
WPT cna) =O Tess Rasa) (1) 
holds for any permutation a1,a@2,...,Q@, of the numbers 1,2,...,n. For 


wave functions of antisymmetric states (1) holds only for even permutations 
Q1,Q02,..-,Qn, otherwise 


Wis Treat a) SU as Pane es ae) (la) 


Now a symmetric state always remains symmetric at all times and similarly 
an antisymmetric state remains always antisymmetric. This follows from the 


Statistics of Composite Systems According to the New Quantum Mechanics 65 


time development of Y which is described by a linear differential equation being 
symmetric in 


Pg 10 08 Tn. 


There exist other states as well, besides the symmetric and antisymmetric 
states (in the case of two particles only these two exist; but for three particles 
there are three, and for four particles there are five kinds of conceivable states). 
However, these two cases are the simplest and the most important, and we need 
not consider the others, since Dirac has shown that Bose statistics applies for 
particles in symmetric states and Fermi statistics has to be used for those in 
antisymmetric states. Accordingly, quanta of light are always in symmetric 
states and electrons are always in antisymmetric states. The latter applies also 
for protons. 

In what follows, we shall see that for composite bodies the first or the 
second statistics applies depending whether it consists of an even or an odd 
number of particles. To determine this we have to treat nuclei as being built 
up of protons and electrons. 


3. Consider first two hydrogen atoms in their normal states confined to a 
box. Let the coordinates of the nuclei be R; and Rz and those of the electrons 
rı and r2; we denote by L the wave function of the normal state of the H-atom 
(the first Laguerre-function) and the possible stationary wave functions of the 
box, that is the de Broglie’s functions, by Bg. Let M and m be the masses of 
the nucleus and the electron respectively. The following four states have the 
same energy: 


pı = By (=e) L(Ry - 11) By (ae) L(Ro - 2), 
who = Br Fr) L(Rı — 12) Be Fr) L(Rz = 11), 
m) arm) Ron) 


Out ofthese functions, we can construct only one function that is antisymmetric 
in both of the pairs Rı, Ra and rı,r2. The states realizable in nature correspond 
to such functions as 


Der = V1 — 03 — (2 — Wa) = Ver - (3) 


As we see a single state corresponds to any combination kk’ irrespective of its 
order; there is also a state for k = k’; we get exactly the same number of states 
as required by Bose’s statistics. If we write 
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MR+mr = 


M-+m (4) 


corresponding to the large nuclear mass compared to the the electron we get 


Wk! = [Br(Rı)Br (R2) + By, (Re) Br (Rı)] L(Rı = rı)L(Ra = r2) 
— |Bk(Rı)Br (Ra) + Br(Ra2)Br (Rı)) L(Ra — rı)L(Rı = T2) ; (5) 


The expression in square brackets is characteristic for Bose statistics. We have 
just seen that the number of the stationary states of a gas consisting of two H- 
atoms in their normal states is given by Bose statistics; (5) shows that also the 
statistics of their position corresponds to Bose statistics, at least if we consider 
domains large compared with the radius of the H-atom. In this case Ypg’ differs 
considerably from zero only if either Ry — rı and R2 — r2 or Ry — rz and R2 -rı 
is very small. In both cases only one of the two terms is dominant and the 
factor 


[Bk(R1)Be (R2) + Br(R2)Be(Rı)] 


gives Bose statistics for the position, (e. g. the maximal probability of the state 
Rı ~ Rə.) 


4. If the number of atoms is more than two, let us say n, we have two write 
the only antisymmetric combination in place of (5) 


Pki vet kn = ee >, [LR = T(r) )Er 
o x TA 


L(R =f] er Rr- ri 
= L Beo a) Glee ee) ee 


where o and r denote the permutations 1(c), 2(o),...,n(o) and 1(r), 2(r), 
... n (T) respectively of the numbers 1,2,...,n. The summarizations over o 
and r must be extended for all the n! permutations, €+ = +1 or €+ = —l1 
depending whether 7 is an even or an odd permutation. Now we can derive the 
requirements of Bose statistics from (5a) similarly to the case (5), rigorously for 
stationary states, but also for the statistics of the position provided we consider 
domains large compared to the radius of the atom. 

For the case of more complicated atoms or molecules we must proceed 
similarly and we obtain within the same limitations Bose or Fermi statistics 
depending whether the body is built up from an even or odd number of particles. 

In the preceding considerations we did not take into account either the 
magnetic moments of the protons and of the electrons or the possibility that 
the normal state may be degenerate in so far as it has an “internal” quantum 
number that is greater than 0. Our results are exactly valid for the states is 
only. In other cases we have to remedy this by counting all the cells of the phase 
space as many times as the degree of degeneracy of the state considered, and 
then apply Bose’s or Fermi’s statistics. We must proceed in this way even in 
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the case of electrons since their lowest states are doubly degenerate on account 
of their magnetic moments. (See e. g. W. Pauli jr.: Zs. f. Phys. 41, 81, 1927.) 


5. In the paper just mentioned W.Pauli jr. studies the behavior of the 
electron gas on the basis of Fermi’s statistics. A. Sommerfeld [3] gave with 
great success further applications of this statistics. 

This application is not so easy for compound bodies. In gases with temper- 
atures so low and densities so high that a considerable deviation from Boltz- 
mann’s statistics can be expected the atoms are nearer to each other than 
allowed by the restrictions of the preceding sections which made our results 
applicable, or gas condensation may take place. 

The statistics of atomic nuclei can be established far more certainly than 
the statistics of entire atoms. Heisenberg first called attention to the absence 
of even or odd lines in the band spectra of molecules which can be traced back 
to nuclei that can be in only antisymmetric or else in only symmetric states. It 
can be shown generally (E. Wigner and E. E. Witmer: Zs. f. Phys. 51, 859, 1928, 
loc. cit. in what follows) that the absence of lines can be expected only for such 
molecules consisting of identical atoms (H2, Hez, N2, O2 etc.) which have no 
“electric moment”. There are four kinds of such states which we denote by O+, 
O’, and O_, O/.. In loc. cit. it is also established that in case of O, and O/, the 
even lines are symmetric while, on the contrary in case of O_ and O the even 
lines are antisymmetric. If only symmetric states exist in the nuclei and they 
have no angular momentum then the antisymmetric states are absent and vice 
versa, if they have angular momenta the antisymmetric states yield far weaker 
lines. 

Experiments show for the four molecules mentioned before that the odd 
lines are always strong, the even ones are completely absent for He, and Oz. 
Consequently, the nuclei of O and He have no angular momentum at all while 
the angular momentum of H and N are 3 fe and fe respectively (see R. de 
L. Kronig: Die Naturwissenschaften 16, 225, 1928.) In order to decide whether 
the nuclei are in symmetric or antisymmetric states we must know whether the 
spin angular momentum of the electron is O_ or Ox. It is not quite elementary 
to determine this. However, the basis of loc. cit., starting from certain criteria 
(especially on the basis of band convergence) it is nevertheless possible. Based 
on this it can be concluded that the normal state of H, and N; is 7O,, for O2 
it is ?O_, and for Hex it is probably !O_. 

This leads us to the conclusion that the following rule is also correct for 
nuclei: bodies built up of an even number of particles follow Fermi’s statistics. 


References 
[1] Zs.f. Phys. 38, 411 (1926) 
[2] Proc. Roy. Soc. 112, 661 (1926) 
[3] Die Naturwissenschaften 16, 372 (1928), papers in Zs. f. Phys. in 1928 
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STATISTIK ZUSAMMENGESETZTER SYSTEME 
NACH DER NEUEREN QUANTEN-MECHANIK. 


Von E. WIGNER. 


Es wird die Frage untersucht, durch welche Statistik (BoLrz- 
MANN, Fegmı oder Bose) man die Entartungserscheinungen, sta- 
tistischen Gewichte usw. von zusammengesetzten Systemen 
(Atomen oder Molekülen) beschreiben kann. Natürlich wird diese 
Beschreibung nur solange gelten, als man die Systeme als 
Massenpunkte ansehen darf. 

Zur Entscheidung der Frage wird die Wellenfunkzion eines 
Gases, bestehend aus den betreffenden Atomen, bezüglicherweise 
Molekülen, aufgeschrieben (@1(5) für ein Gas aus zwei Wasser- 
stoffatomen, Gl(5a) für den allgemeinen Fall. Die grossen R 
sind die Schwerpunktskoordinaten, die kleinen r die inneren 
Koordinaten des Atoms). Aus dieser Wellenfunktion wir dab- 
gelesen, dass das ganze System Ferrmti-scher oder Bosz-scher 
Statistik genügt, je nachdem die Anzahl der das System auf- 
bauenden Fermi-schen Elementarteilehen ungerade oder gerade ist. 

Fiir Atomkerne ist die Bedingung, dass man sie als Massen- 
punkte ansehen kann, bei optischen Versuchen allenfalls erfüllt. 
Die Bandenspektren zeigen in der Tat eine Bestätigung der 
Theorie. Das Spektrum von H, (Grundzustand !F,) zeigt, dass der 
FH-Kern Frrui-scher, das Spektrum von N} Grundzustand °F), 
dass der A-Kern ebenfalls Feruı-scher, das Spektrum von O, 
(Grundzustand ®$_), dass der O-Kern Boss-scher Statistik genügt. 


(Aus der Sitzung der III. Klasse der Ungarischen Akademie der Wissen- 
schaften vom §. Okt. 1928.) 


Über die Geschwindigkeitskonstante 
von Austauschreaktionen 


H.Pelzer und E. P. Wigner 


Zeitschrift für physikalische Chemie B 15, 445-471 (1932) 


(Eingegangen am 19. 11. 3].) 


Es wird zunächst versucht, die Annahme von Lonpow zu begründen, dass 
die chemischen Austauschreaktionen adiabatisch verlaufen. Die wesentliche Be- 
dingung hierfür scheint es zu sein, dass die unterste Energiefläche energetisch weit 
getrennt von den übrigen liegen soll. Ferner wird für die Umsetzung von Para- 
wasserstoff in normalem Wasserstoff, für die EYRING und PoLaxYı die Aktivierungs- 
wärme berechnet haben, die absolute Grösse der Geschwindigkeitskonstante be- 
stimmt. 


1. Wir kennen drei verschiedene Typen von chemischen Reak- 
tionen: 
I. Elementare Addition und spontaner Zerfall (Prädissoziation): 
H+H= d, D 
AIH S Al + H. 


Dass solche Reaktionen wegen des gleichzeitigen Bestehens von 
Energie- und Impulssatz nicht ohne Schwierigkeiten vor sich gehen 
können, haben zuerst M. Born und J. Franck bemerkt!). 

Die Reaktion (I) kann zweierlei Ursachen haben. Im ersteren 
Falle?) kann man von einem Molekül ausgehen, das keine Elektronen- 
anregung, nur Rotationsenergie und Schwingungsenergie hat. Man 
kann nämlich bei einem rotierenden zweiatomigen Molekül die Zentri- 


fugalkräfte durch ein ,,Zentrifugalpotential‘ 1 (M =Drehimpuls, 


m =relative Masse der beiden Atome, r = ihr Abstand) ersetzen. Wenn 
dieses Potential zusammen mit dem Anziehungspotential der beiden 
Atome bei grösseren Kernabständen ein Maximum hat, so sind klassisch 
solche Bewegungen des Moleküls möglich, deren Energie kleiner als 
dieses Maximum, aber doch grösser als die Dissoziationsenergie ist, 
und die trotz dieses letzten Umstands stabil sind. Nach der Quanten- 
mechanik sind aber auch diese Bewegungszustände nicht vollkommen 
stabil, sie zerfallen infolge des ,Tunneleffekts“, weil ein Potential- 


1) M. Born und J. Franck, Z. Physik 31, 411. 1925. 2) J. OLDENBERG, 
Z. Physik 56, 563. 1929. 
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berg in der Quantenmechanik, selbst wenn die Energie zu seiner 
Uberschreitung nicht hinreicht, kein absolutes Hindernis ist und 
,unmechanisch“ übersprungen werden kann. Das einfachste Beispiel 
hierfür bildet der radioaktive Zerfall?). 

Die zweite Reaktionsart nach dem Schema (I) ist die Prädissozia- 
tion?). Es handelt sich hierbei um den Zerfall eines Moleküls mit 
Elektronenanregung, oder in der Rückreaktion um die Bildung eines 
Moleküls mit Elektronenanregung aus den Atomen. 

Die absolute Grösse der Geschwindigkeitskonstante der Reak- 
tionen nach dem Schema (I) hängt eng mit der Linienbreite zusammen 
und lässt sich mit ihrer Hilfe absolut angeben °). 

II. Den zweiten Reaktionstyp bilden die sogenannten doppelten 


Umsetzungen: Na + CIH 2 NaCl + H 
H + 0l, Z HCl+Cl. | 


Wenn sich ein Na-Atom einem HCl-Molekiil nähert, wird es von 
diesem zunächst abgestossen. Hat es aber eine genügende kinetische 
Energie, um diese Abstossung zu überwinden, so lockert es gleich- 
zeitig die CI-H-Bindung. Bei einem gewissen kritischen Abstand hört 
dann die Abstossung zwischen dem Na und Cl auf, während die bis- 
herige Anziehung zwischen dem Cl und H in Abstossung übergeht. 
Daher wird das H abgestossen, das Na und Cl werden dagegen ein 
neues Molekül bilden. Wesentlich ist für die Diskussion dieses Reak- 
tionsverlaufs, dass die potentielle Energie nicht die Gestalt 


rı)+g (713) + A (r23) 
hat, vielmehr die Kräfte zwischen dem H und Cl durch die Anwesen- 
heit des Na auch ihrerseits bzeinflusst werden. 

. Der vorbsschriebane Mechanismus wurde von F. Lonpon®) ent- 
deckt und von Eyrrne und POoLANYT?) genauer besprochen und zur 
Berechnung von Aktivierungswärmen verwandt. Charakteristisch für 
den ganzen Mechanismus ist, dass er adiabatisch, d.h. ohne Elek- 
tronensprung verläuft. Die vorliegende Note wird sich ausschliesslich 
mit dieser Reaktionsart beschäftigen und wird erstens Bedingungen 


(I) 


1) G. Gamow, Z. Physik 51, 204. 1928. E.U.Coxpon und R. W. GURNEY, 
Nature 122, 439. 1928. 2) R. DE L. Kronie, Z. Physik 62, 300. 1930. Siehe da- 


selbst auch weitere Literatur. 3) M. PoLanyı und E. Wiensr, Z. Physik 33, 
429. 1925. O. K. Ric, Physic. Rev. 33, 748. 1929. 35, 1538. 1930. 4) F. Lox- 
DON, SOMMERFELD - Festschrift, S. 104. S. Hirzel, Leipzig 1928. 5) H. EYRING 


und M.Pouanyi, Z. physikal. Ch. (B) 12, 279. 1931. 
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anzugeben versuchen, unter welchen man annehmen kann, dass die 
Reaktion adiabatisch verlauft. Zweitens soll versucht werden fiir 
einen Fall, in dem diese Bedingungen erfüllt sind, die absolute Grösse 
der Geschwindigkeitskonstante zu bestimmen. 

In die Gruppe (II) können wir auch die sogenannten Energie- 


und die Ladungsübertragungen ?) 
Hg + Na Z Hg+ Na (IIb) 
einreihen, die allerdings keineswegs adiabatisch verlaufen. 
Auch die Assoziation durch Dreierstoss 


Na + Na + He= Na, + He | Ic) 
H+01+ He 2 HCl+ He | 


‚gehört hierher, über die wir allerdings theoretisch sehr wenig wissen. 

Praktisch übarwiegt sie gewöhnlich an Geschwindigkeit die Reak- 
tion (I), die dasselbe Resultat hätte. 

Auch diese Reaktion kann adiabatisch verlaufen. Betrachten wir 
z.B. die Reaktion H + H + H 2H,- H, und nehmen wir als einfachsten 
Fall an, dass die drei Atome zunächst auf einer Geraden liegen, und 
dass auch die Geschwindigkeiten so beschaffen sind, dass die gerade 
Konfiguration erhalten bleibt. Dann kann man die Bewegung durch 
die Abstände c und 5 der seitlichen Atome vom mittleren Atom be- 
schreiben. Die zeitliche Änderung von c und b wird, wie in dem von 
EYRING und PoLanyı näher untersuchten Fall H,+ H durch die zeit- 
liche Änderung der Koordination c und b einer Kugel gegeben, die auf 
der in Fig. 1 gegebenen Energiefläche abrollt. Der Unterschied gegen- 
über dem von Eyrine und PoLaryı untersuchten Fall besteht nur 
darin, dass man hierbei nicht von einem der beiden Molekültäler (I) 
und (II), sondern vom Hochplateau ausgehen muss, das die Mitte der 
Figur einnimmt. Erteilt man der Kugel auf diesem Hochplateau etwa 
die in der Figur eingezeichnete Geschwindigkeit, so wird die Kugel 
längs der eingezeichneten Linie rollen und aus dem Molekültal (II) 
nicht mehr herauskommen, sondern darin unter Schwingungen in 
Richtung wachsender c fortschreiten. Dies entspricht der Bildung 


1) H. KALLMANN u, F. Lonpon, Z. physikal. Ch. (B) 2, 207. 1929. 2) H. KALL- 
MANN und B. Rosen, Z. Physik 61, 61. 1930. 
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eines Moleküls H,, das also auf diesem Wege adiabatisch und ohne 
Aktivierungswärme aus den Atomen entstehen kann. 


Ill. Der dritte Typ ist schliesslich die Dissoziation durch Inter- 
ferenz der Eigenschwingungen, etwa: 


N,0>N,+0. (III) 


Der Reaktionsmechanismus besteht darin!), dass die verschie- 
denen thermisch angeregten Eigenschwingungen, die die Bindungen 
bald dehnen, bald zusammendrücken, nach einer hinreichend langen 
Zeit an einer Bindung in Phase kommen und diese dann infolge ihres 
Zusammenwirkens sprengen. Über die Aktivierungswärme solcher 
Reaktionen können wir im voraus fast gar nichts aussagen. Wenn diese 
bekannt ist, kann man die absolute Grösse der Geschwindigkeitskon- 
stante roh abschätzen?). Interessant ist auch die Abhängigkeit der 
Reaktionsgeschwindigkeit von der Konzentration ?). Die Geschwindig- 
keit dieser Reaktion übersteigt die von (I) bei höheratomigen Mole- 
külen. Bei zweiatomigen Molekülen ist dagegen der Reaktionsmecha- 
nismus (III) ausgeschlossen, da bei diesen nur eine Eigenschwingung 
existiert und eine Interferenz von Eigenschwingungen daher nicht mög- 
lich ist. 

2. Wir wollen hier zuerst die Frage untersuchen, unter welchen 
Umständen eine Reaktion vom Typus (II) adiabatisch verlaufen wird. 
Die Vorstellung, die den nachstehenden Betrachtungen zugrunde liegt, 
ist dabei folgende: 


1) M. PoLtanyl und E. WIGNER, Z. physikal. Ch. (A) 139, 439. 1928. 
2) H. KUMMEROW und M. VOLMER (Z. physikal. Ch. (B) 9, 141. 1930) haben die 
Reaktion (III) quantitativ untersucht und finden eine 100- bis 1000mal kleinere 
Geschwindigkeitskonstante, als man nach der erwähnten Rechnung erwarten würde. 
Dies dürfte aber nicht so sehr auf der Ungenauigkeit der Rechnung beruhen, als 
vielmehr darauf, dass sich während der Reaktion (III) das Multiplettsystem des 
Gesamtsystems ändert: N, ist ein Molekül im Singulettzustand, der Grundzustand 
von O ist ein Triplettzustand. Daher ist das System, das durch die rechte Seite 
von (III) dargestellt ist, im Triplettzustand, während N,0 ein Singulettmolekil ist. 
Die Reaktion N0 > N+ O ist also eigentlich ‚verboten‘, und diese Tatsache 
kann für den Faktor von etwa 0°001 verantwortlich gemacht werden. Natürlich 
würde man ohnehin keine gute Übereinstimmung in diesem Falle zwischen Rech- 
nung und Experiment erwarten, da es sich um die Zersetzung eines sehr wenig- 
atomigen Moleküls handelt, während die Rechnung für hochatomige Moleküle an- 
gestellt worden ist. Doch dürfte dies keinen Faktor von der Grössenordnung 0°001 
bedingen. 3) Vgl. etwa die Diskussion bei O. K. Rick und H. RAMSPERGER, J. 
Am. chem. Soc. 46, 1617. 1927. L. S. KASSEL, J. physical Chem. 32, 225. 1928. 
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Wir können uns die SCHRODINGER-Gleichung für das ganze System 
aufgeschrieben denken 
h 0 (X, a, t) 
274 òt 


h? 2 vw h? d? Ww 
«8? M 2 Xi Sm bak” N 
Hierbei sollen die x, die Koordinaten der Elektronen, die X, die 
Koordinaten der Kerne bedeuten, die Massen der Kerne wurden der 
Einfachheit halber gleich angenommen. Lassen wir für einen Augen- 
blick den Operator der kinetischen Energie der Kerne weg, so können 
wir nach dem Vorgang von BoRN und OPPENHEIMER?) im übrig- 
gebliebenen Operator H, die Kernkoordinaten X als Parameter be- 
handeln und die Eigenwertgleichung 
y h? 0° x 7 
E.(X)y,(X, x) = A, y(x, x) = Y d Tr p (X, x) Wy (2) 


nm dx 


für jeden Wert der Parameter X, gelöst denken. Es werden natürlich 
sowohl die Eigenwerte E,(X), wie auch die Eigenfunktionen von den 
Werten der Parameter X, abhängen. Die Orthogonalitäts- und Norma- 
lisierungsbedingungen lauten ?) 

f(y (X, z dr =1;  fy-(X,xz)y(X,x)dx = 0 fir x+ 4 (3) 
oder nach X, differentiiert 


(3a) 


ò? Ww, B2 2 | 
foster a rt 


v 


wo beidesmal nur über die Elektronenkoordinaten zu integrieren ist. 
Die E,(X) bilden im Konfigurationsraum der Kerne allein Flächen, 
die wir als Energieflächen?) bezeichnen werden. Jede Energiefläche 
entspricht einem Zustand der Elektronen, der stationär wäre, wenn 
die Kerne wirklich an ihre Lagen festgenagelt wären. Jeder Punkt 
einer Energiefläche entspricht einer Konfiguration der Kerne. 


1) M. Born und I. R. OPPENHEIMER, Ann. Physik 84, 457. 1927. Vgl. auch 
C. ZENER, Physic. Rev. 88, 277. 1931. Unsere Überlegung beruht ebenso wie die 
von BORN und OPPENHEIMER darauf, dass die Kernmassen gross im Verhältnis zu 
den Elektronenmassen sind. 2) Die y, können und sollen als reell voraus- 
gesetzt werden. 3) Eine solche Energiefläche, nämlich die unterste Energie- 
fläche für den Fall dreier H-Atome ist in Fig. 1 gegeben. Sie wurde der Arbeit 
von POLANYI und EYRING entnommen. 
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Man sagt, dass die Bewegung des Systems adiabatisch etwa auf 
der ersten Energiefläche abläuft, wenn man so rechnen darf, wie wenn 
die Kerne unter dem ale eines Potentials Z,(X) stehen würden 


Ws 


ae Ae 
SS 


b Abstand der Atome Xund Zin A 


Fig. 1. Potentialgebirge dreier geradlinig angeordneter Wasserstoffatome. Die ein- 

gezeichneten Linien sind Niveaulinien. Nach Eyrina und Potanyt in schief- 

winkligen Koordinaten gezeichnet, um die Dynamik des ‘Systems richtig wieder- 

zugeben. Die mit einem Pfeil versehene Linie stellt eine mögliche Bewegung des 
Bildpunktes dar. 


und sich so langsam bewegen würden, dass die Elektroneneigenfunktion 
der Bewegung immer folgen kann, d.h. immer y,(X,x) bleibt. Um 
dies auszudrücken, zerlegen wir Y 
P(X, x,t) = aa tek. t) Y, (X, x). (4) 
Setzen wir dies in (1) ein, so können wir durch Multiplikation 
mit y, (X, x) und Integration über die Elektronenkoordinaten die Be- 
wegungsgleichung für die y, in en Weise ableiten 


"im a Oo Mamet ee 


7 h? ow, Ò Y Je 3? Ya | | 
PH IX, ee ax, 'J dX; 2 ORY. 


Man kann die Wellenfunktion Y durch die Wellenfunktionen 
Xv Xe Xz -- - ersetzen, die Bewegungen der Kerne allein entsprechen. 


(5) 
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Allerdings braucht man nicht nur eine Wellenfunktion dieser Art, son- 
dern unendlich viele. 

Wären in (5) die Glieder der zweiten Zeile, die tatsächlich klein 
sind, nicht vorhanden, so würde sich x, genau so verhalten, als ob 
es die Wellenfunktion von Massenpunkten wäre, zwischen denen das 
Potential Z,(X) herrscht. In Wirklichkeit sind aber die zu den ver- 
schiedenen Energieflächen gehörenden Wellen %1, % Xs; - - - nicht ganz 
unabhängig voneinander, weil sie sich wegen der Glieder der zweiten 
Zeile von (5) gegenseitig beeinflussen. 


Die Glieder der zweiten Zeile von (5) sind deswegen klein, weil 
in ihnen die Masse des Kerns im Nenner auftritt. In der Tat ist 


f m y„dx von der Grössenordnung eines reziproken Atomradius. Da- 
2 
k u /Atomradius (Wellen- 


- 


her hat die zweite Zeile die Grössenordnung 5 = 5 


länge der Kernwellen) und ist daher der Atomradius/(Wellenlänge der 
Kernwellen) kleiner als das Glied mit der kinetischen Energie der 
Kerne. Wir wollen dies aber noch 
genauer ausführen und zeigen, 
dass die Glieder der zweiten Zeile 
von (5) bei der Berechnung der 
Reaktionsgeschwindigkeit der 
Umsetzung H+ ,BZH,+H 
wirklich vernachlässigt werden 
können. 

3. Es sei die Energie des 
Systems E, und wir wollen die 
Voraussetzung machen, dass E 
kleiner ist als selbst der tiefste x 
Punkt Z, „ der zweiten Energie- Fig. 2. 
fläche (vgl. Fig. 2). Dies ist für 
die oben erwähnte Reaktion für die meisten Zusammenstösse, die zur 
Reaktion führen, erfüllt. Ihre Energie ist nämlich nur um weniges 
grösser als die Aktivierungswärme, während der tiefste Punkt der 
nächsten Energiefläche drei getrennten Atomen entspricht und daher 
wesentlich höher ist. Wenn wir die Glieder in (4), die den höheren 
Energieflächen entsprechen, zu einem Glied vereinigen 


W(X, x, t) = 7, (X, t) y, (X, x) + u (X, 2, t), (4a) 
so lautet H Y =E Y 


£ 


Ez (%9) 
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h? k ò y dyp, d7 0? “Hy 
>> 1 _Tı 1 1 
E (I)a — ga a axe tox. ox, Fasz) TA" 


(6) 
=Ey,y, + Eu 


und wir wollen zeigen, dass die Wahrscheinlichkeit der Anregung aller 
höheren Energieniveaus zusammen ff u? dzdX immerfort (für alle 
Zeiten) sehr klein ist. Wir multiplizieren (6) mit u* und integrieren 
über alle Koordinaten, wegen (3) verschwindet f u* y,dz schon über die 
Elektronenkoordinaten integriert. Ausserdem ist 


SEEN 
- MN s/f 2 xX , Saks oa 
> NM s/f sx ‘dxdX + Eom ff u? dedX, 

so dass wir 

{fu* Hz, y, ded X= — a Upi oe. ea ttn w)azax]| 


(7) 
er oi sz ot l dzdX — (Es n — E) ff u dzax] 
erhalten. Hier kann man das erste Glied rechts weglassen, dadurch 


wird die rechte Seite sicher nur vergrössert, und es folgt mit Hilfe der 
BESSELSchen Ungleichung 


E E dean 


Vif Eng ae 2 3X ae ae — E) ff u: dzdX. 


(8) 
ax 


Dies kann man mit (If u? Sn) kürzen, und die linke Seite 
folgendermassen abschätzen: [2 yl Hy dz ist eine Funktion der X, 


OX, 
das fiir irgendein Wertesystem dieser ein Maximum hat, das wir mit yf 


bezeichnen wollen. Ähnliches gilt von | (> a =) dx, doch können 


wir das zweite Glied in (8), da es klein gegenüber dem ersten ist — 
x, weist ja wesentlich stärkere Schwankungen auf, als p, — über- 
haupt weglassen [vgl. auch (*)]. Dann bleibt aus (8): 
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1) = 
ore aV [S| a3 32,1 ® > (Be m BE)V ff uf dxdX. (9) 
òy | : l 
Nun müssen wir =| | SA tax abschätzen. Zu diesem Zweck 
k | 


multiplizieren wir dar konjuge Komplexe von (6) mit y,y, und'inte- 

grieren wieder. Den kleinsten Wert von £,(X) bezeichnen wir mit E,» 

das zweite Glied von (6) gibt nach einer partiellen Integration eben 

das, was wir für (9) brauchen. Wegen (3a) fällt das dritte Glied ganz 

weg und auch das vierte können wir weglassen, weil es klein dem 

zweiten gegenüber ist [die Schwankungen von y, sind viel kleiner als 
die von yı, vgl. auch (*)]: 
2 '2 

ioe >43 aX + ffy y, Hu*dzdX <E— E; n. (10) 

Statt ff yıyıHu*dzdX können wir auch ffu*H y, y,ıdzdX schrei- 

ben, und dies haben wir schon in (7) berechnet. Das erste Glied der 


ersten Zeile von (7), das grösser als das zweite ist, ist wegen der BESSEL- 
schen Ungleichung kleiner als 


ffu* Hyny, dzdX< ay Ve za) [| S33 Or plz, ‘ae 


ò X, 0X; 
(11) 
he yy [dz | 
<gery VSl iw dxax idx V| gi. a Aak 


und kann daher neben dem ersten Glied von (10) ebenfalls weggelassen 
werden. Es bedeutet dann (10), dass die kinetische Energie der Kerne 
kleiner ist als der maximale Überschuss der Gesamtenergie E über die 
potentielle Energie. Setzt man (10) in (9) ein, so erhält man schliesslich 
4 h?y?! EB — En 
827° M (E> m — B) Eom — E` 
Der letzte Faktor ist in der Grössenordnung 1 (nämlich etwa 0'1), 

wir müssen daher nur den ersten Faktor, d.h. y, abschätzen. 


Wir wollen diese Abschätzung nicht ganz streng ausführen. Da 
in die Definition von y7 ausser h, e nur die Elektronenmasse eingeht, 


ff uP) de dX <y (12) 


: F 1... 
kann — abgesehen von einer numerischen Konstante — yı nur ——- sein, 


0 
wenn a, der Radius der ersten Bonrschen Bahn ist. Um y, doch 


etwas genauer abzuschätzen, kann man für y, (X, x) die erste Naherung 
einsetzen, die starren Atomen entspricht (die Näherung, die LONDON 


78 Part I. Physical Chemistry 


454 H. Pelzer und E. Wigner 


immer benutzt). Im Falle dreier Wasserstoffatome kann man dann 
die Integration leicht ausführen und erhält | 


yo, _ 3 rip Hs) _U 
Pes. a ia ag A 


In diesem Falle ist also schon das Integral (*) unabhängig von 
den Kernkoordinaten X, von den Abständen der drei Wasserstoff- 
atome voneinander. Der Maximalwert der Integrale (*) ist also gleich 


ihrem Wert. Durch Einsetzen von na in (11) erhält man 
0 


ff uw? dxdX<07-10~%. (**) 


Die Wahrscheinlichkeit der angeregten Zustände, der zweiten und 
aller höheren Energieflächen zusammen, ist immerfort kleiner als diese 
Zahl. Man kann weiterhin auch sagen, dass die Grössenordnung der 
Wahrscheinlichkeit einer Anregung wirklich durch (**) gegeben ist, 
also nicht sehr viel kleiner ist, da die Abschätzungen, die wir vor- 
genommen haben, nur verhältnismässig kleine numerische Faktoren 
bedingen können (etwa einen Faktor 1/5). 

So günstig, wie bei drei H-Atomen liegt die Sache natürlich nur 
selten, weil die Anregungsspannung nur selten so gross gegen die 
Aktivierungsenergie ist, wie hier. Deshalb ist ja eine Anregung in 
diesem Falle scheinbar überhaupt unmöglich und wird erst durch die 
Wechselwirkungsenergie von Elektronenbewegung und Kernbewegung 
ermöglicht. Andererseits ist natürlich bei anderen Stoffen M grösser 
als bei Wasserstoff, was die Wahrscheinlichkeit der angeregten Zu- 
stände wieder herabdrückt. 

Wir wissen jetzt, dass die Wahrscheinlichkeit der angeregten Zu- 
stände klein ist. Dies schliesst natürlich noch nicht aus, dass die 
angeregten Zustände die Entwicklung des unangeregten Zustands we- 
sentlich beeinflussen. Wir müssen daher noch untersuchen, wie gross 
der Fehler sein kann, den man macht, wenn man die Änderung von 7; 
anstatt mit (5) mit 


h oxy "x 0 y, 0 (5 
an Ir Dias dX? p, dæ- xi (5a) 


berechnet. In (5a) wurde zur ersten Zeile von (5) noch ein Glied 


h? oy u) 
u TALNA er 
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hinzugefügt, damit die weggelassenen Glieder x, nicht mehr enthalten 


sollen (wegen dxz=0). Das Zusatzglied, das als ein Zusatz- 


OY, 
aX, Yı 
potential aufgefasst werden kann, ist wie (*) zeigt, sehr klein, von 
der Grössenordnung von 0'02 Volt. 

Berechnet man die Änderung von 7° mit (5a), die von y, mit (5), 


d.h. setzt nach einiger Umformung 
h oy h? SAA h? oy, | 
RE WORT Da A E a 


du oY, d*y | 
mma? 9X, 0X, +usge|de, 
so wird sich x} von x, im Laufe der Zeit entfernen. Aus (5a) und (5b) 
folgt für die Differenz 


0 
yp J uaa dX = 


h du Au 0 

EEE 5 Yı Wilio \x 

DM DAE PAS a u) Zu atdi, 
wo das J bedeutet, dass vom darauffolgenden Ausdruck der Imaginär- 
teil genommen werden soll. Es folgt, wenn man das zweite Glied rechts 


als klein dem ersten gegenüber weglässt, mit Hilfe der BesseLschen 
Ungleichung: 


> ran Vis] lanax 
u / (a _. 2 Yı | , 


Nun folgt aus (7), wenn wir diesmal das zweite Glied rechts weg- 
lassen und für die linke Seite (11) einsetzen, und noch darin (12) und 
(10) berücksichtigen: 

' dxdX <7 


fos 
smm—)) |, n? M Ez ,,—E 


Die Wahrscheinlichkeit, dass y? ein von y, verschiedener Zustand 
sein soll, ist daher kleiner als 


(5b) 


(12) 


| du |? Ry E-Eim 5 


e hy? °” E — E, m 
O “dq 1 
ff | M1 X1 | dX < (- a) E: par g p (14) 


Diese Grösse wächst zwar nach hinreichend langer Zeit über alle 
Grenzen an, es genügt aber, für ¢ die Dauer eines Zusammenstosses 
20, __%VM 
ee N es ara 

Vv VE —E 1 m 
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(v ist die Geschwindigkeit) einzusetzen, nach Ablauf dieser Zeit werden 
sich ja die Reaktionsteile voneinander entfernt haben und es wird 
überhaupt keine Umsetzung mehr stattfinden können. Dass unsere 
Formel diesen. Tatbestand nicht wiedergibt, beruht darauf, dass wir 
unabhängig von ¢ mit den grösstmöglichen Werten der für die Störung 
massgebenden Grössen gerechnet haben. Wir können daher [mit Hilfe 
von (*)] für die Reaktion H + H, Z H+H 


9h? ] 1 


| Ex 2 en se eh E 
fixi X1 | dX < x?’ M . a? E,„— E 10 (t) 


schreiben. Dies ist 7 a z 
h? M 


—— 5; may, zu einer anderen Es „ 
8 2" mas ‘ 


schwerere Kerne noch kleiner und zeigt, dass man die Vorstellung von 
Energieflachen fiir die Atome sehr gut verwenden kann. 


Die Abschätzung (t) ist im Gegensatz zu der Abschätzung (**) 
eine sehr schlechte, weil sie gewissermassen annimmt, dass die Störung 
immer die günstigst mögliche Phase hat. Diese Annahme ist in dem 
Übergang von (12) zu (14) enthalten. Die Abweichung von yi von 
y, ist in Wirklichkeit wesentlich kleiner, als die Grenze in (14) es 
erlaubt, weil die Störungen im Laufe der Zeit sich nicht immer 
stärken, sondern zuweilen auch durch Interferenz schwächen werden. 
Wir wollen aber hierauf an dieser Stelle nicht eingehen und nur noch 
bemerken, dass noch ein Effekt in unserer Rechnung nicht ganz sinn- 
gemäss berücksichtigt worden ist. 


mal das Verhältnis einer atomaren Energie- 


grösse —E. Diese Zahl wäre für 


Nachdem sich nämlich die Reaktionspartner getrennt haben, 
können offenbar keine Elektronenanregungen mehr stattfinden und 
alle Partner müssen — da die Energie zu ihrer Anregung nicht hin- 
reicht — im Grundzustand sein. Die Glieder in (5), die die Übergänge 
-zwischen den verschiedenen Energieflächen bedeuten, verschwinden 
aber auch für weit getrennte Reaktionspartner nicht. Dies hat seine 
Ursache darin, dass die von uns als adiabatisch bezeichneten Bewe- 
gungen längs der Energieflächen solchen Bewegungen entsprechen, bei 
denen die ,,Mitbewegung des Kerns“ für die einzelnen Reaktions- 
partner nicht berücksichtigt ist. Bei Berücksichtigung der Mitbewe- 
gung ist ja die Wellenfunktion »,(X, z)x(MX + mx) nicht yı(X, 2)x(X). 
Solange die Atome nahe zueinander sind, kann man die Mitbewegung 
natürlich nicht abtrennen, weil man nicht weiss, welcher Kern von 
welchen Elektronen mitbewegt wird, und tatsächlich erzeugt die Mit- 
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bewegung während dieser Zeit Übergänge zwischen den Energieflächen. 
Wenn sich die Atome trennen, sollte aber die als adiabatisch be- 
zeichnete Bewegung automatisch in eine solche übergehen, bei der 
jeder Kern von seinen eigenen Elektronen mitbewegt wird. Doch 
scheint es nicht ganz einfach zu sein, dies zu erreichen. 

Natürlich ist mit dieser Rechnung über die Frage der Sprünge 
zwischen zwei Energieflächen nur sehr wenig ausgesagt. Erstens gibt 
es Fälle, in denen der Sprung in eine andere Energiefläche nicht nur 
ein häufiger Vorgang ist (ein Beispiel hierfür findet sich am Schluss 
dieser Arbeit), sondern sogar mit Sicherheit auftritt. Die unterste 
Energiefläche wird ja nur selten so weit getrennt von den übrigen 
Energieflächen liegen, wie in un- 
serem Beispiel, häufig werden 
sich die Energieflächen fast über- 
schneiden. 

Aber auch wenn unsere Vor- 
aussetzungen bezüglich der Lage 
der Energieflächen zutreffen, würde 
noch der Fall eine gesonderte Un- 
tersuchung erheischen, dass eine 
Umsetzung auf der untersten Fig. 3. 
Energiefläche sehr unwahrschein- 
lich ist. Dies kann z. B. vorkommen, wenn die Energie E zu klein 
ist zur Überschreitung des Potentialwalles, der der Reaktion im 
Wege steht. So ein Fall wird schematisch in Fig. 3 veranschaulicht. 

Man könnte denken, dass die Welle, die auf der zweiten Energie- 
fläche ist, ungehindert nach rechts hinüberlaufen kann, und dort 
wieder auf die untere Fläche hinunterfallen wird. Die Wahrscheinlich- 
keit einer Umsetzung auf diesem Wege wäre zwar im Sinne der voran- 
gehenden Ausführungen klein, da sie sich aber schon bei niedrigerer 
Energie abspielen könnte, als die normale Umsetzung, würde sie doch 
meistens vorherrschen, weil eine Herabdrückung der Aktivierungs- 
wärme die Geschwindigkeit der Reaktion viel mehr erhöht, als sie 
die Einführung eines sterischen Faktors erniedrigt. 

Wir haben die Rechnung für die zwei Energieflächen der Fig. 3 
unter der Voraussetzung ausgeführt, dass die beiden Integrale (*) 
konstant sind. Dabei hat sich gezeigt, dass die Welle auf der 
oberen Energiefläche ebenso zurückgeworfen wird, wie 
auf der unteren. Dies beruht darauf, dass an den Stellen, wo y, 


E,(4) 
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verschwindet, für x, eine Differentialgleichung mit einer Eigenwert- 
konstante E gilt, die kleiner ist, als alle Eigenwerte der Differential- 
gleichung (diese sind ja alle grösser als E,,,). Daher existieren zwei 
Lösungen, deren eine, die exponentiell ansteigende, natürlich aus- 
geschlossen ist. Die andere fällt aber exponentiell ab und verschwindet 
praktisch innerhalb 0'05 A. 


4. Wir wollen jetzt zu unserem eigentlichen Gegenstand über- 
gehen und die absolute Grösse (die sogenannte temperaturunabhängige 
Konstante) der Reaktion H + H,=H,+H berechnen. 

Anstatt die Änderung von x, mit (5a) zu berechnen, wollen wir 
klassisch rechnen, weil die klassische Rechnung viel leichter ist. Die 
klassische Rechnung wird — wegen der Nullpunktsenergie — eine 
unrichtige Aktivierungswärme geben, der sterische Faktor, für den 
einige Prozent Abweichungen nichts ausmachen, wird aber wohl nicht 
sehr falsch herauskommen. 

Wir wollen dabei von dem Eyrınag-PoLanviıschen Diagramm 
(Fig. 1) ausgehen. Dieses Diagramm gilt für dieReaktion 

Y+XZ=YX+2Z 

für den Fall, dass das freie Atom Y in der Richtung der Verbindungs- 
linie der beiden Molekülatome X und Z herankommt, weil dieser Weg 
die geringste Aktivierungsenergie benötigt. In den Diagrammen sind 
längs der Koordinatenachse die Abstände 6 und c der beiden äusseren 
Atome vom Mittelatom eingetragen und die Niveaulinien gleicher 
Energie W des Gesamtsystems stellen eine Sattelformation dar, die 
die beiden Täler des Anfangs- und Endzustands trennt; die Höhe des 
Sattelpunktes entspricht dabei (wenn man noch die Nullpunktsenergie 
berücksichtigt, so wie dies in der Eyrune-Ponanvischen Arbeit näher 
ausgeführt wird) der Aktivierungsenergie. Sie wurde dort für die drei 
Reaktionen H + H, (para)=H,+H, H+ HBr=H,+ Br und H+ Br, 
=H Br+ Br nach einem graphischen Verfahren bestimmt, wobei die 
Niveaulinien W berechnet wurden nach der Formel: 


W=D-Ver+B®+y—aß—Py—ya. (15) 

Hierin bedeutet D die um die Nullpunktsenergie vermehrte Disso- 

ziationsenergie des Ausgangszustands; die Wechselwirkungsenergien 
a (b+c); P(b); y(e) 

wurden nach der Morszschen Gleichung für die Potentialkurve zwei- 

atomiger Moleküle als Funktion der Abstände b, c, b+c berechnet. 
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Es könnte gegen die EykınG-PoLanvısche Betrachtungsweise Be- 
denken verursachen, dass nur Reaktionswege betrachtet werden, bei 
denen das dritte Atom in der Verbindungslinie der beiden Molekül- 
atome herankommt. Denn dies ist beispielsweise sicher nicht richtig 
für die Umwandlung von Parawasserstoff ın normalen, weil dabei der 
Drehimpuls sich ändern muss. Ausserdem könnte man befürchten, 
dass, wenn nur dieser Weg erlaubt wäre, zu selten ein Zusammenstoss 
eintritt, bei dem Reaktion möglich ist. Es ist mit eine Aufgabe vor- 
liegender Arbeit, sich von der Bedingung freizumachen, dass die drei 
Atome genau auf einer Geraden liegen müssen. 

Wir stellen uns nun also als nächste Frage die nach der Ge- 
schwindigkeitskonstante, nachdem in der Eykinc-PoLanvischen Ar- 
beit gezeigt wurde, wie die Aktivierungswärme zu berechnen ist. Oder 
anders ausgedrückt: Wie ändern sich quantitativ die Sattelverhält- 
nisse, wenn man auch Zusammenstösse berücksichtigt, bei denen die 
drei Atome nicht in einer Geraden liegen? 

Der Gedanke, der dieses Problem lösen soll, ist der folgende. Wir 
denken uns die Wechselwirkungsenergie in der Umgebung des Sattel- 
punktes entwickelt als Funktion der beiden Abstände 6 und c und 
des Winkels 6 zwischen 6 und c. Wenn wir nun auch diese Funktion 
dreier Variabler uns natürlich nicht als Fläche im dreidimensionalen 
Raum vorstellen können, so bedeutet sie gleichwohl eine Sattel- 
formation in einem Raum höherer Dimensionszahl und es wird auch 
bei ihr einen „Grat“ geben, der die beiden Talgebiete niedriger Energie 
voneinander trennt und dessen tiefster Punkt unser Sattelpunkt ist; 
bloss wird unser Grat keine eindimensionale Linie sein, wie bei den 
Gebirgen unserer dreidimensionalen Umwelt, sondern er wird ein 
Gebilde von zwei Dimensionen sein. Aber wir wollen unsere Über- 
legungen im folgenden so aussprechen, dass man bei der anschaulichen 
Vorstellung eines Gebirgssattels im gewöhnlichen Raum auch alles 
Wesentliche sehen kann. 

Um die Frage nach der Geschwindigkeitskonstante zu beant- 
worten — das ist die Frage nach der sekundlichen Zahl der Prozesse, 
bei denen das trennende Sattelgebiet in der Richtung vom Aus- 
gangstal nach dem Endtal überschritten wird —, stellen wir uns zu- 
nächst vor, wir hätten im Phasenraum des Bildpunktes Gleichgewicht, 
bei dem eben soviel Überschreitungen aus dem Ausgangstal in das 
Endtal vorkommen, wie umgekehrt aus dem Endtal in das Ausgangstal 
zurück. Da ist es nicht schwer, nach der BoLTzmAannschen Formel 
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für eine bestimmte Temperatur die Dichte der Bildpunkte in unserem 
Sattelgebiet und seiner Umgebung auf den Grathöhen anzugeben. 
Ferner können wir sagen, wie gross bei dieser Temperatur die Ge- 
schwindigkeiten unserer Bildpunkte sein werden, und zwar inter- 
essieren wir uns gerade für diejenige Geschwindigkeitskomponente, 
welche auf dem Grat orthogonal steht und aus dem einen Tal ins 
andere führt. Wir können sie über den Geschwindigkeitshalbraum, 
der aus dem Ausgangstal in das Endtal führt, mitteln, und da wir, 
wie gesagt, auch die Dichte der Bildpunkte überall auf dem Grat 
kennen, können wir sofort sagen, wieviele Bildpunkte pro Sekunde 
bei Gleichgewicht aus dem Ausgangstal in das Endtal hinüberwandern. 
Die Dichte der Bildpunkte.und damit der Überschreitungen nimmt 
nach der BoLTzmann-Formel sehr rasch (exponentiell) ab, wenn wir 
aus dem Sattelpunkt den Grat hinaufklettern, so dass man sie sehr 
bequem über das gesamte trennende Gebiet der Grathöhen weg- 
integrieren kann, um die Zahl der Überschreitungen festzustellen. 
Daher werden wir uns (wegen dieser exponentiellen Abnahme der 
Dichte) bei unserer Entwicklung der Energie nach den Abständen in 
der Umgebung des Sattelpunktes mit den quadratischen Gliedern be- 
gnügen, was die Rechnung sehr erleichtert. Die Zahl der so be- 
rechneten Übergangsprozesse beim Gleichgewicht ist aber bekanntlich 
dieselbe, wie wenn die Rückreaktion fehlen würde, weil das End- 
produkt noch nicht vorhanden ist. Mithin haben wir die Zahl der 
Übergangsprozesse berechnet, womit die Frage nach der Geschwindig- 
keitskonstante beantwortet ist. 

Was wir in dieser Weise berechnen, ist streng genommen die 
Anzahl der sekundlichen Gratüberschreitungen im Gleichgewicht, wäh- 
rend wir uns sehr weit vom Gleichgewicht befinden. Das Tal IT ist 
ja ganz leer, obwohl es im Gleichgewicht ebenso viele Bildpunkte hat, 
wie das Tal I. Natürlich ist die Zahl der Gratüberschreitungen in 
Richtung des Tals II unabhängig von der Besetzungsdichte dieses, 
und hängt nur von der Besetzungsdichte des Tals I ab, wo wir Gleich- 
gewicht annehmen wollen. Wir rechnen aber so, als ob überhaupt 
keine Gratüberschreitungen in umgekehrter Richtung in das Tall 
vorkommen würden. Dies ist, obwohl das Tal II anfangs ganz leer 
ist, nicht ganz berechtigt, weil es im allgemeinen möglich ist, dass 
ein Bildpunkt, der den Grat eben nach Tal II überschritten hat, von 
dort sofort in das Tal I zurückgeworfen wird. Einen solchen Fall 
würden wir als Reaktion mitrechnen, weil wir ja nur die Zahl aller 
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Gratüberschreitungen nach Tal II berechnen können. Ein Blick auf 
die Fig. 1 lehrt aber, dass solche Reflexionen des Bildpunktes kaum 
vorkommen können. — 


Wir haben nun den vorher ausgeführten Gedanken durchzuführen. 
Wir wollen es an dem Beispiel der Umwandlung des Parawasserstoffs 
in normalen ausführen, für den A. FARKAS!) den Mechanismus 


H + H, (para) —= H, (normal) + H 


angegeben hat. Bei der Rechnung wollen wir nur die Wahrscheinlich- 
keit der Umsetzung eines Moleküls H, mit einem Atom H berechnen, 
da die Reaktionsgeschwindigkeit im Falle mehrerer Teilchen doch 
proportional zur Konzentration ist. Der erste Schritt hierbei ist die 
Entwicklung der Energie in der Umgebung des Sattels. 

Es gilt jetzt nicht mehr a=b+ c, da wir uns ja gerade von der 
Voraussetzung, dass die Atome in geradliniger Anordnung reagieren, 
freimachen wollen. Nach dem Cosinussatz haben wir vielmehr: 


a?=b°--c?+2bc cos ô. 


Am Orte des Sattelpunktes sei b5=c=b, und ö=0. In der Um- 
gebung dieses Punktes haben wir dann die Lage gegeben durch die 
Zi b=b, +B; c=e+C; ô, (16) 
wobei wir nach B, C, 6 entwickeln wollen und uns mit Gliedern zweiter 
Ordnung begniigen. Dann wird 


= ol s 
a=b+B+¢+C Cra 
Und in der zu entwickelnden Funktion 
W = D—Va?+ f?+ y*— af — py —ya l 


ANE ee ee A 
= D- Vf +0 FAO —fOFO—fOF@, ) | 


wobei f(r) die Morsesche Funktion für das Wasserstoffatom bedeutet: 
f(r) = D(e-2kr- m — 2e-kr—n), 


D =Dissoziationsenergie*) + Nullpunktsenergie, r,—0'75 Ä=nor- 
maler Kernabstand, k=2'09 Ä-! ergibt sich aus der Grösse des Grund- 


1) A. Farxas, Z. physikal. Ch. (B) 10, 419. 1930. 2) Der in der EYRING- 
Potanyvischen Arbeit auf S. 286 angegebene Wert 39132=111 kcal ist zu ersetzen 
durch 37000= 107 keal, wie es übrigens auch den Rechnungen in dieser Arbeit 
richtig zugrunde gelegt wurde. 
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schwingungsquants (in der Tabelle auf S. 286 der Arbeit von EYRING 
und Poranyı gilt die Angabe cm”! für die letzte Spalte natürlich 
nicht mehr, k ist in Ä-! ausgedrückt). 


Wir haben W um die Stelle B=Ü=ö-=0 zu entwickeln; da an 
dieser Stelle ein Sattelpunkt ist, verschwinden dort die ersten Diffe- 
rentialquotienten und unsere Entwicklung beginnt mit den quadrati- 
schen Gliedern, mit welchen wir uns auch begnügen: 


Lew. 
Wb, +B, c,+ 0,6) = W(b,. e436 = 0) +; |» B 
YW, YW) PW 
+52 t ag | 


(Die gemischten Differentialquotienten nach 6 und einem der Ab- 
stände B und C fallen fort.) Ferner ergibt sich in dem von uns be- 
trachteten Falle dreier Wasserstoffatome 
die weitere Vereinfachung, dass b,=c, ist ?), 
und man daher nach dem Differenzieren 
p =y setzen kann, wodurch der Wurzelaus- 
druck den einfachen Wert a—f bekommt. 

Wir haben nun die Differentialquotien- 
ten zu berechnen und sie in (3) einzusetzen. 
Man erhält so 


W=W,+D[065(B?+C°) | 


Fig. 5. Transformation von +149BC+0°096?]. f (18) 

den Grössen b, c auf B, C und , | , p 

schliesslich auf Hauptachsen- Dass die quadratische Form in der 
koordinaten $, 7. eckigen Klammer bezüglich B und C wirk- 


1) In der Arbeit von Eyrına und PoLanyI ist der auf S.290 angegebene 
Wert 0°95 A für den Abstand b,=c, zu ersetzen durch den Wert 0°91 A, wie es ja 
auch aus der dortigen Fig. 7 auf S. 291 zu entnehmen ist. 
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lich einen Sattel bedeutet und nicht etwa eine Mulde, sieht man, 
wenn man durch die Substitution 


c- f= | a 
V2 


auf Hauptachsen transformiert. Schreiben wir nämlich unsere quadra- 
tische Form in (18) allgemein: 


W= W, +1(B?+C°)+mBÜC-+u6, (18a) 
so wird durch die Substitution (5) daraus: 


W=W.+ (1+ F)e—(F - +46 = pé — qr uô; (18b) 
aa m=149D>21=13D, so bedeutet dies tatsächlich Hyperbeln 
als Niveaulinien und & bedeutet die Gratlinie, 7 die Talverbindungs- 


linie. 


Schon jetzt können wir den sterischen Faktor abschätzen. Wir 
sehen aus (18), dass der Koeffizient von 62 etwa die Grösse u= 2 
besitzt. Nähert sich das freie Atom genau in der Richtung der Kern- 
verbindungslinie, so muss es die Aktivierungsenergie W, besitzen, um 
reagieren zu können; kommt es aber in einen Winkel 6 gegen die 


Kernverbindungslinie heran, so muss es also um 2 ô? mehr Energie 


besitzen, um erfolgreich zu sein. Nun ist aber allgemein bei zwei 
quadratischen Freiheitsgraden die mittlere Überschreitung einer be- 
liebig vorgegebenen Energie Wọ=RT1). Wir fragen nun einfach, 


1) Man sieht das ein, wenn man bedenkt, dass der Schwerpunkt der unter 
€ 


der Kurve e £T und der e-Achse liegenden Fläche immer im Abstand RT von 
der sie linksseitig begrenzenden Abszisse entfernt liegt (vgl. Fig. 6): 


fe ‚de HY 
2 
y= Pe — aW, + RT + Glieder mit a usw. 
(ke TT ; 
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welchen Bruchteil der Einheitskugel diese mittlere Überschreitung 
bedeckt. Dazu muss also seın 2 ô? = RT, oder mit D =1'07 -105 cal; 


T ~ 300° abs. (entsprechend den Temperaturen bei GEIB und HARTECE) 
ô? a 0°05. 


N 


Bis zu diesem Winkel werden also im Mittel erfolgreiche Stösse 


vorkommen; nun verhält sich die in der Fig. 7 gezeichnete Calotte 
2 
zur ganzen Kugeloberfläche wie die zugehörige Höhe = zum ganzen 


o and 


Durchmesser 2; sie erfüllt also den Bruchteil d = 5 der ganzen Ober- 
fläche; da das Atom aber auch noch auf der anderen Seite heran- 
kommen kann, ist der sterische Faktor doppelt so gross, also ~ a 


Experimentell finden FARKAS!) sowie GEIB und HaRTEcK?) etwa das 
Fiinffache. 


Berechnung der Dichte im Phasenraum. 


Solange die drei Wasserstoffatome als unabhangig betrachtet 
werden, und wir ihnen die kartesischen Koordinaten zuteilen: 


%, Yı 2,| sind im Abstand b voneinander 
Mittelatom: x, Y, Z,Í (ursprüngliches Molekül) 
ursprünglich freies Atom: x, Y; 23, 


ist das Volumenelement des Konfigurationsraums einfach gegeben 


ounen dz,dy,...a2,5 


1) Farkas, Z. physikal. Ch., loc. cit. 2) GEIB und HARTECK, Z. physikal. 
Ch., BoDENSTEIN-Festband, 849. 1931. 
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und die Zahl der in ihm enthaltenen Bildpunkte ist ihm proportional. 

Um aber die Wechselwirkungsenergie bequemer berücksichtigen zu 

können, müssen wir auf neun andere Koordinaten transformieren, 
und wir wählen als solche 

1. Wie oben die drei kartesischen Koordinaten des Mittelatoms 2: 

t, =X 

(19a) 


2. Für das Atom 1, das mit 2 zusammen das Ausgangsmolekül 
bildet, Polarkoordinaten b, 9 und » mit Atom 2 als Ursprung: 
£, =X — b sin ® cos 
Y, = Y — bsin Ŷ sin g 
z, =Z —b cosè. 
3. Für das Atom 3 Polarkoordinaten c, 6 und y, ebenfalls mit 2 
als Ursprung, aber mit der Molekülachse als Polarachse: 


(19b) 


x, =X +c sin ô cosy cos? C0oSP— csinösinysinp+ccosösind cosy 
Ys = Y+ csin ô cos ycosð sinp+csinösinycosp-+ccosösin® sing 


(19 c) 
2, =Z—csinöcosysin® +ccosöcos®. | 


Die Funktionaldeterminante ergibt sich nach einiger Rechnung zu 


= Ò (%,Y,2, Ly Yı 2, LzYys2z) _ 2. 2 
F= aA FZbpco sin? c’ sind. 


Das Volumenelement in den neuen Koordinaten ist also: 
dv=b:c?dbdc sin 6d6 sin ddbdydgdXdYdZ. 


Wir denken uns nun ein Molekiil und ein Atom eingeschlossen 
in einen Kasten vom Volumen V. Dann hat der Konfigurationsraum 
neun Dimensionen: X, Y, Z, b, 3, p, ¢, 6, y. Um die Wahrscheinlich- 
keit dafiir zu berechnen, dass der Bildpunkt sich eben auf dem Grat 
befindet, müssen wir den allgemeinen Ausdruck für die Wahrschein- 
lichkeit: os 

conste ¥T be dbdcsind dé sind dd dydgdXdYdZ (20) 


über einen Streifen von der Breite An längs des Grates integrieren. 
Die Const bestimmt sich daraus, dass das Integral über den ganzen 
Konfigurationsraum gleich Eins ist. Da der Ausdruck (20) überall 
viel kleiner ist, als in der Nähe der Stelle b=r,, was dem normalen 
Abstand der beiden H-Atome im Molekül entspricht, brauchen wir 
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bei der Bestimmung der Const nur über diese Gegend zu integrieren. 
Dabei können wir W =s(b—r,)? setzen, wo s die elastische Konstante 
des H,-Molekiils ist. Die Integration über X, Y, Z und c, 6, y ergibt V?, 
die m 8, y den Faktor 4rz. Wir erhalten daher 


"= _3b— 70)? 
const 4 u V? fe kT b db=1. 


0 
Hier können wir für b°db=r5db schreiben und erhalten 
E | 
4naV? rè! nkT 

Dies müssen wir in (20) einsetzen. Um die Integration längs des 
Grates ausführen zu können, führen wir nach (19) wieder 


ee Ne 
V2 v2 
ein und erhalten für die Anzahl der Bildpunkte auf den beiden Graten 
(der eine entspricht der Annäherung des freien Atoms von der Seite 


des Atoms 1, der andere einer Annäherung von der Seite des Atoms 2 
an das Fa 


const = 


Nee kT b? 
TR zn o c"d£sinödösinddddwapdXdYaZ. (21) 


Hier ist wieder An die Breite des Gebiets, das wir noch zum 
Grat rechnen. Für W muss hierbei die Entwicklung (18b) eingesetzt 
werden, die in der Nähe der 
en. ETERS S tiefsten Stelle des Grates gilt. 
N ey E ne, Die höheren Stellen des Grates 
i. ale Loan sind ja so unwahrscheinlich, 
dass wir sie ausser acht lassen 
Grat HEE ¿,ġ Können, die überwiegende Mehr- 
zahl der Bildpunkte des Grates 
er = ist in der Nähe der Stelle 
E : J: EA T N $ = b=c w by, 6=0, wenn b, der 
| RRRA ON Abstand der seitlichen Atome 
| Ausgangstal vom Mittelatom für die Sattel- 

Fig. 8. Die Umgebung des Sattels. Nach a 
hie und links, gegen die beiden Grathöhen Die Integration über X, 


zu, nimmt die Besetzung mit Bildpunkten Y, Z, 9, op gibt 4 nV, die 
sehr rasch ab. über y gibt 22, für sind 


. 
o* » 
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schreiben wir ô und erhalten so (längs des Grates ist 70) mit 
Hilfe von (18b) 


Wo (ps?+ ud?) 
_ 20-20 T- 2a:2a V Any e bte ET |e ET ôdëdô. 


nx V’ rè ıkT 
pe We Wo 
_ Am Any) s kT m. (22) 
Vrè p u 


Dies ist die Zahl der Bildpunkte auf dem Grat. Ein Bildpunkt, 
der die Geschwindigkeit n senkrecht zum Grat besitzt, braucht die 


Zeit ze =t zum Durchlaufen des Gratgebiets. Ist der Bruchteil der 


Bildpunkte mit der Geschwindigkeitskomponente senkrecht zum Grat 
zwischen 7 und n+dn gleich w(n)dn, so en aus diesem Ge- 


schwindigkeitsbereich pro Sekunde nw (n) — ar den Grat. Die gesamte 
Reaktionsgeschwindigkeit ist daher 


3 = Wo 
BR w(n)d = ey U, og kr 
An nw(n)dn Zaire Vr: ; (23) 
wo für [nw(n)dn sein Wert eingesetzt wurde (vgl. den Anhang). 

Das ist die relative Zahl der H,-Moleküle, die pro H-Atom im 
Volumen V pro Sekunde umgesetzt werden. 


5. Aus den Versuchen von A. FARKAS und GEI und HARTECK 
ergibt sich folgende Tabelle für die Geschwindigkeitskonstante k in 


wo p für den Augenblick die Konzentration des Parawasserstoffs und c 
die der Wasserstoffatome in Mol/Liter bedeutet. 


| k | 
T j 3 C 
3 | In Te | 
283 86-104 | 29 2.106 
373 27.106 59 5.106 
873 12.109 107 2.106 
1023 1°9- 109 11°0 1°5-106 


Aus den Zahlen der 3. Spalte erhält man für die Aktivierungs- 
wärme W, Werte zwischen 6300 und 7100 cal. Das ist etwas weniger, 
als GEIB und Harreck erhalten, weil sie eine Formel verwenden, 
die T’ enthält an Stelle unseres T°, doch sei erwähnt, dass die 
Aktivierungswärme, wie wir sie berechnen, mehr streut, als nach 
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GEIB und HARTEcKs Berechnungsweise. Mit Hilfe emer Aktivierungs- 
wärme von 6600 cal erhält man dann die Werte der letzten Spalte 


für die Konstante C in der Formel 
Wo 


| k,=CT’he RT, 
Aus (23) ergibt sich dagegen 


4 mbt V: kil 3 
a 2 20 "9 ha —— 
C= 6.109. hey (23a) 
oder mit b,=0'91:10”® cm, r,=0'75 :10”® cm, 
M =1'6 -10-*4g, V= =1'75 


C =?1 - 10°. 


Die gute Übereinstimmung zwischen beobachtetem und berech- 
netem C wird man zweifellos zum grossen Teil einem Zufall zu- 
schreiben, wenn man die vielen Fehlerquellen in Betracht zieht. 

Als Fehlerquellen in unserer Rechnung kommen drei Ursachen in 
Frage. Erstens die Unsicherheit im Energieausdruck (18b) betreffs der 
Konstanten b,, u, p. Doch zeigt (23a), dass nur die erste Grösse sehr 
wesentlich ins Gewicht fällt, und diese ist wohl richtig bestimmt: sie 
streut bei allen Rechnungen von EYRING und POLANYI nur sehr wenig, 
obwohl diese Rechnungen unter sehr verschiedenen Annahmen durch- 
geführt worden sind. Nimmt man an, dass die Aktivierungswärme 
wirklich so klein ist, wie es die von FARKAS angegebene erste Zahl 
andeutet, so würde die Energiefläche wohl noch flacher sein, als von 
uns angenommen, und dies würde C in der falschen Richtung ändern. 
Es scheint daher, dass die grössere Aktivierungswärme wahrschein- 
licher ist. 

Als zweite Ursache kommt in Frage, dass Bildpunkte, die den 
Grat schon überschritten haben, wieder in das erste Tal zurück- 
geworfen werden. Dies wird durch die Fig. 12 von Eyrine und 
PoLanyı nahegelegt, in der zwei Grate sind, die von einer Mulde 
(einer instabilen Verbindung H, entsprechend) getrennt sind. Ist so 
eine Mulde vorhanden, so muss jeder Bildpunkt zwei Grate über- 
schreiten, und ein Bildpunkt, der den ersten Grat bereits überschritten 
hat, kann auf eine ungünstige Stelle des zweiten stossen und von 
dort in die Mulde zurückgeworfen werden. Er wird dann etwas in 
der Mulde herumrollen und diese schliesslich durch einen der Grate 
verlassen. Im ungünstigsten Falle — bei sehr tiefer Temperatur — 


u D 
a ne SEE e 
ER en 
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würde der Bildpunkt die Mulde durch beide Grate mit gleicher Wahr- 
scheinlichkeit verlassen, und dies würde einen Faktor !/, für (23a) 
bedeuten. Wahrscheinlich ist aber bei 1000° dieser Faktor schon 
nahezu Eins. 

Als dritte Fehlerquelle ist schliesslich in Betracht zu ziehen, dass 
die Berechnung der Bewegung des Bildpunktes mit der klassischen 
Theorie ausgeführt worden ist. Das ist wohl im Falle von Wasser- 
stoffatomen nicht berechtigt, wie schon daraus hervorgeht, dass die 
Nullpunktsenergie von H,, ein Quanteneffekt (!/),;hv=6 kcal) in der 
Grössenordnung der ganzen Aktivierungswärme ist. Wie gross aller- 
dings der hierdurch verursachte Fehler ist, lässt sich schwer angeben. 
Bei allen anderen Reaktionen, die sich nicht unter H-Atomen ab- 
spielen, wäre er sicher kleiner. Wir hoffen hierauf beı anderer Ge- 
legenheit noch einmal zurückzukommen. 

Wir wollen noch unsere Gleichung (23a) für die Geschwindigkeits- 
konstante vergleichen mit der sich aus der einfachen kinetischen Gas- 
theorie sich ergebenden Stosszahl?!), die wir mit k bezeichnen wollen, 
ro = Radius von H., rı = Radius von H: 


ko = 47-6109. (77+ 75)? a a (24) 
Es ergibt sich: 
CT’ V 268 V2 ekT 
a = (25) 


ko itr pou. 

Obwohl man denken sollte, dass aus der Loxpoxschen Theorie 
eine sehr niedrige wirksame Stesszahl folgt, weil die Reaktion eine 
sehr spezielle Konfiguration, nämlich die gerade Konfiguration ver- 


langt, ist dies nicht der Fall. Es tritt zwar ein Faktor auf, der 


nach der Überlegung auf S. 463 bis 464 einem sterischen Faktor ent- 


spricht und bei niedriger Temperatur besonders klein ist. Dagegen tritt 
4 

auch —— n 

(ri + 7%) ro 


gestrecktes Molekül, der Nenner auf ein normales Molekül bezieht. 


auf, wo sich der Zähler auf ein durch die Reaktion 


Ausserdem tritt Ve auf, das grösser als Eins ist, dem Umstand ent- 


sprechend, dass die Krümmung der Potentialfläche längs des Grates 
kleiner als im Normalzustand des Moleküls ist. Daher kommt für 


1) Vgl. K. F. HERZFELD, Kinetische Theorie der Wärme. 
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die Bildpunkte ein längerer Teil des Grates in Frage, als der Abstand 
der Atome im Molekül variieren kann, und dies erhöht die statistische 
Wahrscheinlichkeit des Grates. Zudem ist die Aktivierungswärme, 
wenn man sie nach der Formel 


g= RT L m. 
dT VT 
berechnet, um RT höher als W,. Berechnet man nun aus „lie BF 
zurückzu das C’, so wird sie das e-fache von CT”. Daher tritt bei dem 
Vergleich des in dieser Weise berechneten ,,temperaturunabhangigen 
Faktors“ C’ mit der Stosszahl zu (25) noch ein Faktor e hinzu. 


Eine halbe Potenz der Temperatur tritt in (23a) [wie auch in (24)] 
auf, weil die Anzahl der Zusammenstösse proportional zur Geschwindig- 
keit der Moleküle ist. Weitere zwei halbe Potenzen rühren daher, 
dass für die Reaktion zwei Bedingungen mehr (b=b,, c=c, 6=0) 
erfüllt sein müssen, als für den Normalzustand (b=7,). 

6. Wir haben bisher so gerechnet, wie wenn durch die Umsetzung 
normaler H, entstehen würde, d.h. Ortho- und Parawasserstoff im 
Verhältnis der statistischen Gewichte 3: 1. Dies erscheint angesichts 
der Tatsache, dass die vorhandene Energie gross gegen den Abstand 
der Ortho- und Parazustände ist, von vornherein sehr wahrscheinlich. 
Wir wollen es aber noch genauer begründen. 


Stellen wir uns ein Parawasserstoffmolekül vor, so sind die beiden 
Kernspins sicher antiparallel und jedes hat mit der Wahrscheinlich- 


keit 1/, die +Z- und mit der Wahr- 
CED | | QŒ D scheinlichkeit 1/,die —Z-Richtung. 


AS Betrachten wir nun einen Zusam- 

GD $ —> A (== menstoss, bei dem der Kernspin 
des freien Atoms die +Z-Richtung 

——) hat, dies ist bei der Hälfte aller 

QD | as | ER Zusammenstösse der Fall. Nun 
wird sich hieran im Laufe 

Ct) | —> t «HD des Zusammenstosses nichts 
Fig. 9. ändern, weil die Zeit eines 
Kernaustausches zu gross 

ist: Er ist in der Grössenordnung der Periode der einquantigen Ro- 
tation, während die Zeit des Zusammenstosses der Periode einer sehr 


vielquantigen Rotation entspricht. Die Atome, die nach dem Zu- 
sammenstoss das Molekül bilden werden, haben also mit der Wahr- 
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scheinlichkeit 1/, einen gesamten Kernspin +1 in der Z-Achse. Tritt 
dies ein, so ıst das entstandene Molekül sicher im ,,Kerntriplett‘‘- 
Zustand, ein Orthowasserstoffmolekül mit der gesamten Z-Spinkom- 
ponente ?/} Die Wahrscheinlichkeit, dass bei einem Zusammenstoss 
dieser Zustand entsteht, ist also !/,. Da aber die anderen Ortho- 
wasserstoffzustände mit derselben Wahrscheinlichkeit entstehen müs- 
sen, ist die Gesamtwahrscheinlichkeit für das Entstehen eines Ortho- 
wasserstoffmoleküls */,. Insbesondere zeigt es sich, dass bei dem Zu- 
sammenstoss, bei dem die Spinkomponente des entstehenden Moleküls 
in der Z-Achse Null ist (zweite und dritte Zeile der Fig. 9) mit gleicher 
Wahrscheinlichkeit ein Ortho- und ein Para-molekül entsteht. 


Anhang. 


Den Mittelwert von 7 berechnet man wohl am einfachsten, indem 
man sich daran erinnert, dass 


(B-C) _ 6-0 


n=- 


V? v2 

ist, wo b und c die Abstände des ersten und dritten Atoms vom zweiten 

(6 — ec) 
2 


Atom sind. Wegen der Geradlinigkeit der Konfiguration ist aber 


der Abstand des Schwerpunktes der Atome 1 und 3 vom Atom 2, so 
dass 7) das Y2-fache der Relativgeschwindigkeit des Atoms 2 gegen 
den Schwerpunkt der Atome 1 und 3 ist. Die Wahrscheinlichkeit einer 
Relativgeschwindigkeit v3 ist aber 


-1 _ Mat n 
e kT 2U+M) 2 
2 


wo M die Masse des zweiten Atoms ist und für M’ die Summe der 
Massen der Atome 1 und 3, d.h. 2M eingesetzt werden muss. Wir 
erhalten somit 1 i 

w(n) = const-e FT 6, 
V6kT 
Va 
muss noch mit 2 dividiert werden, weil nur die Halfte der Bildpunkte 
die richtige Geschwindigkeitsrichtung hat. 


und hieraus berechnet sich der Mittelwert von h zu - Dies 


Die Arbeit wurde ausgeführt im Kaiser Wilhelm-Institut für 
physikalische Chemie in Berlin-Dahlem. Wir möchten es nicht ver- 
säumen, Herrn Prof. POLANYI auch an dieser Stelle für viele wertvolle 
Besprechungen herzlichst zu danken. 


Über das Überschreiten von Potentialschwellen 
bei chemischen Reaktionen 


E.P. Wigner 


Zeitschrift für physikalische Chemie B 19, 203-216 (1932) 


(Eingegangen am 30. 7. 32.) 


Es wird die Durchlässigkeit einer Potentialschwelle gegenüber einem Atum- 
strom mit MaxwEuuscher Geschwindigkeitsverteilung berechnet für den Fall, dass 
es erlaubt ist, die Quantenkorrektion nur bis zu Gliedern mit der zweiten Potenz 
der PLanckschen Konstante zu berücksichtigen. Das Resultat wird mit Eckarts 
exakter Berechnung der Durchlässigkeit einer bestimmten Potentialschwelle ver- 
glichen und auf die Umwandlungsgeschwindigkeit von Parawasserstoff in normalen 
Wasserstoff angewandt. 


1. Es wurde schon wiederholt auf die Möglichkeit hingewiesen, 
dass das unmechanische Überschreiten von Potentialschwellen bei 
chemischen Reaktionen eine Rolle spielt!). Die meisten chemischen 
Reaktionen bestehen ja darin, dass ein Atom eine Potentialschwelle 
überschreitet und so eine neue Konfiguration des Systems herbeiführt. 
Die Höhe der Schwelle äussert sich als Aktivierungswärme. 


Es sind hierbei drei Fälle zu unterscheiden: Entweder ist die 
Schwelle so dick, dass die Atome sie praktisch gar nicht durchdringen 
können — dies wird gewöhnlich stillschweigend angenommen —, dann 
verläuft die Reaktion klassisch. Wenn die Schwelle sehr dünn und 
hoch, die Temperatur sehr niedrig ist, so tragen hauptsächlich die 
Atome mit geringen Geschwindigkeiten zur Reaktion bei. Es ist wahr- 
scheinlich, dass für keine bis jetzt bekannte Reaktion dieser Fall vor- 
liegt, sie würde sich durch nur geringe Temperaturabhängigkeit der 
Reaktionsgeschwindigkeit bemerkbar machen. Bei genügend tiefer 
Temperatur muss zwar dieser Mechanismus immer der ausschlag- 
gebende sein, aber diese Temperatur ist in den allermeisten Fällen so 
niedrig und die zugehörige Reaktionsgeschwindigkeit so klein, dass sie 


1) F.Hunv, Z. Physik 43, 805. 1927. J. R. OPPENHEIMER, Physic. Rev. 31, 66. 
1928. Bourain, Pr. Nat. Acad. Washington 15, 357. 1929. R.M. LANGER, Physic. 
Rev. 34, 92. 1929. M. Born und J. Franck, Nachr. Götting. Ges. 1930, 77. S. Ro- 
GINSKI und L. ROSENKEWITSCH, Z. physikal. Ch. (B) 10, 47. 1930. 15, 103. 1931 
M. Born und V. WEISSKoPF, Z. physikal. Ch. (B) 12, 206. 1931. 
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praktisch überhaupt keine Rolle spielt. Der dritte Fall, den wir zu 
unterscheiden haben, ist schliesslich der, dass die reagierenden Mole- 
küle fast genug Energie haben, um die Schwelle mechanisch über- 
schreiten zu können, dass aber ein endlicher Prozentsatz unter ihnen 
doch eine etwas niedrigere Energie hat und durch den obersten Teil 
des Berges. unmechanisch hindurchgeht. Dieser Effekt wird nur eine 
Korrektion für die klassisch berechnete Reaktionsgeschwindigkeit be- 
deuten, wegen ihrer prinzipiellen Bedeutung soll sie aber im folgenden 
doch ausführlich behandelt werden. 

Schon C. EckArr!) hat darauf hingewiesen, dass man in diesem 
Falle die bekannte WENTZEL-BRILLOUINsche Methode zur Berechnung 
der Wahrscheinlichkeit der Schwellenüberschreitung nicht benutzen 
kann. Er stellte jedoch ein Potential auf, für das man die ScHRÖ- 
DINGER-Gleichung für jede Energie exakt lösen kann. Wir brauchen 
viel weniger, nämlich die Durchdringungswahrscheinlichkeit nur für 
Geschwindigkeiten, die nahezu gleich der Geschwindigkeit sind, bei 
der das Überschreiten der Schwelle mechanisch eben noch möglich ist. 
Dagegen wollen wir diese Wahrscheinlichkeit für eine beliebige Po- 
tentialschwelle berechnen. Am Schluss wollen wir unser Resultat auf 
die ECKARTsche Potentialschwelle anwenden und die Resultate ver- 
gleichen. | 

2. Wir denken uns eine Potentialschwelle V (x) und einen Strom 
von Atomen, der von links (negative x) im Temperaturgleichgewicht 
auf die Schwelle zuströmt und fragen nach der Zahl derjenigen Atome, 
die reflektiert und die durchgelassen werden. Da wir es mit einem 
System zu tun haben, das im wesentlichen der klassischen Mechanik 
gehorchen soll, und bei dem die Quanteneffekte eine nur geringere 
Rolle spielen sollen, können wir versuchen, mit der Wahrscheinlich- 
keitsfunktion P(x, p) zu rechnen. Diese Wahrscheinlichkeitsfunktion 
entsteht aus der statistischen Matrix?) U (x, x’) des Systems durch die 


Transformation?) 2 Bine 


P(&,p)= |Ule+yx-y)e * dy (1) 


— € 


und ihre Veränderung mit der Zeit ist durch 


0Pwp) poP VOP eat (Ss) a ò yV òP 


E EI aiae ap + (2) aios ap O 


~ m Ox | Ox Op 


1) C. EcKART, Physic. Rev. 35, 1303. 1930. 2) L. Lanpav, Z. Physik 45, 
430. 1927. H. WEYL, Z. Physik 46, 1. 1927. J.v. Neumann, Nachr. Götting. Ges. 
246, 245. 1927 . 3) E. WIGNER, Physic. Rev. 40, 749. 1932. 
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gegeben (h ist dabei die PLancksche Konstante dividiert durch 22). 
Wir interessieren uns für eine Lösung dieser Gleichung, die stationär 
ist, für sehr grosse negative x und 

positive p gleich e”???” wird, für 

sehr grosse positive x und nega- 

tive p dagegen Null ist. Die erste V(x) 
Bedingung bedeutet, dass wir es | | 
mit einem stationären Zustand zu 
tun haben, die zweite, dass die auf 
die Schwelle zuströmenden Atome 
im Temperaturgleichgewicht sind 


undihre Temperatur T= nn ist, die 
I 


8 


dritte Bedingung besagt schliess- 

lich, dass von rechts keine Atome 

auf die Schwelle zuströmen. Für 

den klassischen Fall sind die Strö- 

mungslinien der Atome im 2, p- 

Raum (Phasenraum) in Fig. 1 ein- Fig. 1. 
gezeichnet. 

Eigentlich interessieren wir uns nicht für das ganze Strömungs- 


bild, sondern nur für den Gesamtstrom | pP(z,p)dp für sehr grosse x. 

A | 
Es liegt daher nahe, (2) von 0 bis co zu integrieren (die linke Seite 
kann gleich Null gesetzt werden) 


>." 1 geet V (2) d?” P(x, 0) 


DN a a u 3 
< 237 (2n +1)! drr tl op?” ( ) 


lar 
E 35) p P(x, p)dp = 
0 7 
Diese Formel erlaubt es uns, den Strom mit Hilfe der Werte der 
Wahrscheinlichkeitsfunktion an der Stelle p= 0 zu berechnen. 


Da ein unmechanisches Überschreiten der Schwelle nur für Ge- 
schwindigkeiten in Frage kommt, die ganz in der Nähe der kritischen 


Geschwindigkeit v = liegen (V, ist die Höhe der Schwelle), wird 


die Wahrscheinlichkeitsfunktion, abgesehen von der Umgebung der 


Stellen ka + V(x) = V, genau so wie im vollständigen thermodynami- 


schen Gleichgewicht aussehen. Für P(e, p)in der Mitte des schraffierten 


On Crossing Over Potential Barriers in Chemical Reactions 99 


206 E. Wigner 


Teils der Fig. 1 können wir daher die Werte für das thermodynamische 
Gleichgewicht einsetzen?) 


3p? P à 
_ -+> — pV h? 82 02 V h? 33 oV\: h: 8? 20° V 
Pap=e ime 1- Fes Sim (32) + Fame? veto) @ 


Ebenso können wir fiir den inneren Teil des Gebietes, in dem in 
Fig. 1 P(x, p)=0 ist, auch jetzt P(x, p)=0 annehmen. Nur fiir das 
Trennungsgebiet des schraffierten Teiles vom unschraffierten müssen 

p wir eine etwas genauere Überlegung 
anstellen, die Bahnen werden dort. 
etwa so verlaufen, wie es in Fig. 2 
angedeutet ist. Dies wird durch die 
——= 72777: höheren Glieder von (2) bewirkt, die 
et den in der klassischen Wahrscheinlich- 
7 keitsfunktion vorhandenen Sprung 
auszuglätten bestrebt sind. 
ne Wir wollen zunächst annehmen, 
ei, dass die Verteilung für das thermo- 
a. dynamische Gleichgewicht (4) inner- 
halb des ganzen schraffierten Teiles 
Fig. 2. von Fig. 1 gilt, also insbesondere für 
p= 0 bis x=0, und dass die Wahr- 
scheinlichkeitsfunktion ausserhalb des schraffierten Bereichs wirklich 
Null ist. Berechnen wir den Strom unter dieser Voraussetzung mit 
Hilfe von (3), so haben wir nur noch in der Umgebung von x =0, 
»=0 die Gleichung (2) streng zu lösen und in dem Teil von (3), in 
dem wir die falschen Werte für P(x, p) benutzt haben, diese durch die 
richtigen Werte zu ersetzen. 


-” 


3. Setzen wir (4) für x< 0 in (3) ein, setzen wir dagegen für x >0 
das P (x, 0) =0, so erhalten wir bei Vernachlässigung von Gliedern mit 
höherer als der zweiten Potenz von h für r50 


e p Væl h?3 02 V(x) 23° (OV (x) \2 > 
fE Pepa =e e -a a t) © 
0 


1) Die Normierung ist dabei so, dass an den Stellen vollen thermodynamischen 


Gleichgewichts V m Atome pro Zentimeter sind, oder, was dasselbe bedeutet, 
í 


bei z = — œ sind VE nach rechts laufende Atome. 
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und für 2=0 


l h? 3 x 
[2 Panar = [#0 Der 6a) 
0 
wo = SL A wie im folgenden allgemein V,= 7 L ) sein soll. 


Bereits (5a) gibt einen guten Näherungswert für die Durchlässigkeit 
der Potentialschwelle gegenüber einem Atomstrom von der Tempe- 


1 
ratur T 


j 


Schreiben wir nämlich (2) für z~ 0 und eine symmetrische Schwelle 


p òP œ 'h\2n 1 grtip : 
wet Sle) wean Pea? 0 


und behalten wir fiir den Augenblick nur zwei Glieder der Summe. 
Dann können wir durch eine Substitution x = ay, p=a’g alle numeri- 
schen Koeffizienten gleich machen, so dass (6) in 


oP oP òP 
hey Yoa Sop Y (6a) 


auf 


übergeht. Dabei ist a = Ve; und dies wird daher auch die unge- 
fähre Grösse des Übergangsgebietes der Fig. 2 sen, in dem P von ef" 
auf Null herabfallt. In einem Gebiet dieser Grössenordnung haben wir: 
daher in (3) ein falsches P verwendet. Der für den Strom hierdurch 
pr Te. Obwohl 


nun dies auch mit der zweiten Potenz von h geht, ist es doch ‘viel kleiner 
als das andere Korrektionsglied, das zs (5a) steht, das Verhaltnis der 


beiden ist von der Grössenordnung — 


verursachte Fehler ist von der Grössenordnung e 


yi 75 7 Das ist aber für alle 
praktisch in Frage kommenden Temperaturen sehr klein. 

4. Im folgenden sei das zuletzt besprochene Korrektionsglied ein- 
gehender betrachtet, d.h. sein genauer Koeffizient berechnet!). Wir 
beschränken uns dabei auf eine symmetrische Potentialschwelle. Es 
muss das Übergangsgebiet in P(x, p) in der Umgebung der Stelle 
x=0, p=0 genauer untersucht werden. 


Die Überlegun g am Schlusse des vorigen ae gibt uns die 


247, 

Lv. . Daher 
1) Da das Glied, dessen Koeffizient in diesem Punkt berechnet werden soll, 

in den praktisch wichtigen Fällen sehr klein ist, ist die Kenntnis von 4 für das 

‚spätere nicht nötig. 


Grössenordnung von S in diesem Bereich, sie ist = 
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ist in dem fraglichen Gebiet jedes Glied von (2) von gleicher Grössen- 
ordnung, und wir dürfen die Reihe (6) nicht abbrechen, wie wir das 
in der provisorischen Betrachtung in (6a) getan haben. Wir müssen 
eine Lösung von (6) finden, die ausserhalb des Übergangsgebietes 


(eines Gebietes von der Grösse « - Var. impr in der x-Richtung und 
mV 


der Grösse «= V= >i 2 in der p-Richtung) sich so verhält, dass es 
—B Vo 


mit wachsendem Abstand vom Nullpunkt zu e (eigentlich zu 
e` "Yo (1+h2...), aber die Glieder mit A? können wir in diesem Korrek- 
tionsglied vernachlässigen) geht, wenn wir uns auf einer Geraden vom 
Nullpunkt wegbewegen, deren Richtung in das schraffierte Gebiet 
der Fig. 1 weist, die also einen Winkel zwischen » und p+ x mit der 
x-Achse einschliesst, wo tg y = V — mV, ist. Entfernt man sich dagegen 
auf einer Geraden vom Nullpunkt, die einen Winkel zwischen p— x 
und p mit der x-Achse einschliesst, so soll die Lösung zu Null gehen. 

An Stelle dieser Lösung können wir auch eine Lösung betrachten, 
die im ersten Winkelbereich zu ze? Yo, im zweiten zu 7 e-PPo geht. 
Wenn wir zu dieser Lösung noch die triviale Lösung e”?" von (6) 
addieren, erhalten wir die Lösung, die die vorerwähnten Grenzbedin- 
gungen befriedigt. 

Nun ist (6) linear sowohl in x, wie auch in p. Durch eine LAPLACE- 
Transformation in beiden Argumenten 


P(x, p) = ((Qy, de -!«v-rrdydg (7) 


können wir sie daher zu einer partiellen Differentialgleichung ersten 
Grades machen (die in sehr naher Beziehung zur gewöhnlichen SCHRÖ- 
DINGER-Gleichung steht) 
y 2Q_ 2 Sr Vente (ts dQ -0| 
m òq M EED dy 
m Ò F (2 dy | 
Die Reihe der ersten Zeile ist ja die TayLor-Reihe für die Ab- 


leitung von V an der Stelle Die allgemeine Lösung dieser Gleichung 


(8) 


nn Qu.9) = FW - Wig) (9 
wo f eine beliebige Funktion ~ und zur Abkürzung 
wo = 52 (v-r) (9a) 


gesetzt ist. 
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Um die Grenzbedingungen zu befriedigen verfahren wir folgender- 
massen!): Zunächst bleibt (6) invariant, wenn man darin x durch —x 
und p durch — p ersetzt. Mit (7) ist daher auch O getzy-PVdydq 
oder auch os 

Piz, p) =f J QU) sin @y—pg)dydg (7a) 
0- o 
eine Lösung; (7a) hat gleichzeitig die erwünschte Eigenschaft an der 
Stelle —x, —p entgegengesetzt gleich wie an der Stelle x, p zu sein. 
Nunmehr setżen wir versuchsweise 


Uy, g) = C- ôy — Wig) (10) 
mit der Drracschen 6-Funktion. Es ergibt sich nach einer einfachen 
Umformung, indem man für y? =z einsetzt und die Integration über z 
ausführt o = 
VW.e- 80 f sin (pg — xV W(q)) d 

ave J vwo 


Für die Konstante C wurde dabei ein bestimmter Wert genommen, 
um die Grenzbedingung zu erfüllen. Es bedeutet wiederum 
ò' W (0 
Ws I ), 
Man überzeugt sich leicht, dass (11) die Differentialgleichung (6) 
befriedigt, das folgt auch aus seiner Herleitung. Da indessen (11) nicht 
absolut konvergiert, ersetzen wir es durch 


Ve 07: fü es ey FEN ag 
EDS a VWa) — = 


und lassen nachträglich a gegen Null gehen. Im folgenden wollen wir 
mit (lla) rechnen. Um noch die Randbedingungen zu verifizieren, 
nehmen wir an, dass x sehr gross ist. Dann wird das Integrations- 
gebiet mit endlich grossem gq nichts zum Integral beitragen, weil der 
Integrand in diesem Gebiet sinusförmige sehr rasche Schwankungen 
ausführt. Nur das Gebiet um q=0 wird einen endlichen Beitrag 
liefern, weil der Nenner = q=0 verschwindet. In diesem Gebiet 


P(x, p) = (11) 


können wir W (q) durch = 5 Wag? ersetzen und dann bis oo integrieren, 
da das Integral doch ae konvergent ist. Nach bekannten Formeln 
wird dann für z = œ, p = œ 


1) Für seine freundliche Hilfe bei der Bestimmung der richtigen Lösung von (6) 
sei Herrn J. v. NEUMANN auch an dieser Stelle bestens gedankt. 
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a 5 | 
P(x,dx) = Eee f= evr sin ((p — zV W/2)q) | 
y2 qV W,/2 


[gezen fir = >VW,/2=V—mP, 
I 4e- en für 2 < VW,/2=V—mV, 


wie wir es gewünscht haben. Die gesamte Wahrscheinlichkeitsfunk- 
tion E der Nähe von x=0, p=0 erhalten wir, indem wir zu (lla) 


noch 5 Lae *¥o addieren. 


een müssen wir mit Hilfe der eben gewonnenen Wahrscheinlich- 
keitsfunktion die Zunahme des Stromes in der Nähe des Nullpunktes 
nach (3) berechnen und daraus die bisher fälschlicherweise ange- 
nommene Zunahme (I,xe"?” bis x=0, von dort ab Null; wir sind 
ja immer in der Nähe des Nullpunktes, und Korrektionsglieder können 
wir in diesem Korrektionsglied vernachlässigen) abziehen. Zunächst 
haben wir für (3) 


Ss: 1 etl (x) 0°" Pl, 0) 
a ( 


2n 


2n +1)! Aaıri 


zu berechnen. Es ergibt sich dafür für kleine x, wenn wir für P(x. p) 
(lla) benutzen 


VWa -pn V hyn Von ge sin (— x YW(q)) ah Rt Sha, 
ee ern SD Garner ran m 
_ _V2WM -n wa) sinz VWa) a 
a5 efri 1VW@ ae 


Hierzu ware noch wegen des zu (lla) hinzukommenden Gliedes 
1 


- 4: e 1 - . RT 
a eto die Grösse > V ve~’? To zu addieren und dann, um die bisher 
fälschlicherweise angenommene Zunahme wieder abzuziehen, bis x = 0 
der Betrag W,xe””"° zu subtrahieren. Im ganzen bleibt zu (12) 


Eis : 
z sign v.g V,e—F% 


zu addieren, wo sign x= 4+ 1 für z>0 und =—1 fiir <0 ist. Wir 
können für dieses Glied auch 
e- mE f Smeay Ty? dy (12a) 


0 
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schreiben. Zum Schluss wäre die Summe von (12) und (12a) in der 
Umgebung von x= 0 über x zu integrieren, um das restliche Korrek- 
tionsglied für (5a) zu erhalten. Die Integration über x kann man 
aber auch von —oo bis oo erstrecken, da der Integrand nach beiden 
Seiten vom Nullpunkt sehr stark abfällt. Da der Integrand eine gerade 
Funktion von z ist, kann man dies auch durch das doppelte von 0 bis oo 
genommene Integral ersetzen. Um weiterhin die Integrationsreihen- 
folge umkehren und zuerst über x integrieren zu können, versehen wir 
den Integranden mit einem Faktor e—°*, wobei wir später b zu Null 
machen werden. Nach Ausführung der Integration über x erhalten wir 


W'(q)b W,b 


rat: op - BV NEUN u hen E a, 
a e 7 (eae Bw) dq. 


Dies muss noch nach g von Null bis oo integriert werden. Da b 
am Schlusse zu Null gehen soll, ist es klar, das nur das Gebiet um 
q = 0 etwas zum Integral beitragen wird — fiir endliche g und kleine 6 
verschwindet der Integrand. Man tiberzeugt sich leicht, dass es 
genügt im ersten Glied W (q) und W’(g) bis zum zweiten Glied in eine 
Reihe zu entwickeln, das nächste Glied würde im Resultat schon mit 
einem Faktor b behaftet sein. Dadurch geht (13) in das Integral einer | 
rationalen gebrochenen Funktion über, das man entweder elementar 
oder auf komplexem Wege auswerten kann. Man erhält so 


(13) 


Wie Vo Ve fn (14) 
24m W, 96 mV, 
als letztes Korrektionsglied fiir den Strom durch die Schwelle, so dass 
sich dieser nach (5a) zu 
2 2 
J Z P(co,p)dp = er 5 Aue (14 a) 
0 
ergibt. 

Wir haben schon gesehen, dass (14) in den meisten Fällen, die 
praktisch von Interesse sind, sehr klein gegen das erste Korrektions- 
glied ist. Es ist aber sehr störend, zu sehen, dass es über alle Grenzen 
steigt, wenn V, zu Null geht. Es ist nicht leicht zu sehen, ob diesem 
Verhalten von (14) ein tatsächlicher Effekt zugrunde liegt, oder ob 
es nur im verwendeten Näherungsverfahren seine Begründung hat. 
Es ist ja klar, dass zwischen den verschiedenen Ableitungen von V 
Beziehungen bestehen müssen, wenn es die Form einer einfachen 
Schwelle haben soll. Andererseits ist aber auch unsere Rechnung wegen 
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Konvergenzschwierigkeiten mathematisch keineswegs streng, so z.B. 
divergiert (11) für p=0. 

5. In der klassischen statistischen Mechanik kann man die Wahr- 
scheinlichkeit der Überschreitung einer Schwelle sehr einfach folgender- 
massen berechnen: Man betrachtet den Fall vollständigen thermo- 
dynamischen Gleichgewichts. Dann ist die Anzahl der Atome mit dem 
Impuls p auf der Schwelle = e”f"re-?r*m und folglich die Gesamt- 
zahl der Atome, die die Schwelle pro Zeiteinheit nach der einen Rich- 
tung hin passieren 


oO 
- 


IE - 8 -Ez a ~ 87; 
me e dp = aoe (15) 
0 

Nun kann man annehmen, dass dies auch die Zahl der Atome 
ist, die die Schwelle dann passieren, wenn diese von einem Strom von 
Atomen im Temperaturgleichgewicht nur von der einen Seite 
getroffen wird. Man sieht ja, dass im Falle einer einfachen Schwelle 
alle Atome, die die Schwelle nach rechts passieren, von der linken 
Seite stammen, und ihre Zahl wird gleich sein, unabhängig davon, ob 
von der rechten Seite Atome kommen oder nicht. 

Diese Überlegung kann man in dieser Form in die Quanten- 
mechanik nicht übertragen, weil man überhaupt nicht von der Wahr- 
scheinlichkeit von Atomen reden darf, die an einer bestimmten Stelle 
sind und dabei eine vorgeschriebene Geschwindigkeit haben. Doch 
kann man immerhin versuchen, die der klassischen entsprechende 
Grösse mit Hilfe der Wahrscheinlichkeitsfunktion (4) zu berechnen. 
Man erhält so 


œ 


pp? 

fz e-ve 2m (1 Ey V,p*\dp = e- êV (5 - 7 V) (15a) 
0 

also tatsächlich das richtige erste und damit wichtigste Korrektions- 

glied von (14a). 

Dies erlaubt es einem, zu erraten, was die Wahrscheinlichkeit 
der Schwellenüberschreitung im mehrdimensionalen Falle ist. An sich 
könnte man die Rechnung genau so wie im eindimensionalen Fall aus- 
führen, nur erscheint sie noch etwas mühevoller. Es sei auch be- 
merkt, dass die in 3. verwendete vereinfachte Überlegung genau zu 
unserem Resultat (17) führt. 

Im Falle mehrerer Dimensionen handelt es sich nicht so sehr um 
die Überschreitung einer Schwelle, als um die eines Grates, wie sie 
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aus der Theorie der einfachsten chemischen Reaktionen hinlänglich 
bekannt ist. Es spielen dabei zwei Effekte eine Rolle, die sich gegen- 
seitig zu kompensieren versuchen: der bisher betrachtete Tunneleffekt, 
der die Wahrscheinlichkeit der Gratiiberschreitung erhöht, und die 
Nullpunktsenergie, die eine grössere Energie für die Überschreitung 
eines engen Passes erheischt, als dem untersten Punkte des Passes 
entsprechen würde. Wir legen das Koordinatensystem so, dass das 
Potential in der Nähe des Passes die Gestalt 


Ve, +++ a) = V, +5 (Aa? + Apr? H + Ane) (16) 


habe. Dabei ist A, negativ, alle anderen Koeffizienten positiv. Die 


Wahrscheinlichkeitsfunktion lautet längs des Grates (x, =0) 
EL A Pe ERROR RER IE 2 
Play tn Pa Bq) = e BME TA o- ay, gg Matte Haah 


Í ELAT. | WPB ige 2 bad A2 x? 
u Sm | 1t + n) + 5g a ( 2% F + Ay Xn) (16 a) 
| h2 83 
+ 3g m2 (pi A, + en + p? An)) 
und mit A multipliziert und über %, 73, ..., x, und alle Momente 


integriert, ergibt dies für den Strom über den Grat 
Amın +1)q_qn-1 e-?Po í h? 8? 


AE T A 5, At + 4p). (17) 


Es muss jedoch bemerkt werden, dass hierbei Zusatzglieder, wie 
(14) konsequent weggelassen worden sind. Diese treten hier nämlich 
nicht nur aus demselben Grunde wie im eindimensionalen Fall auf, 
sondern eigentlich schon bei der Integration von (16a), wenn man für V 
auch höhere Glieder als in (16) angegeben mitbenutzt. Dies ist übrigens 
schon im klassischen Teil der Formel der Fall. Ausserdem muss be- 
merkt werden, dass die ganze Überlegung, die zu (15) führt, im mehr- 
dimensionalen Fall auch in der klassischen Statistik nicht mehr ohne 
weiteres berechtigt ist, weil es nicht mehr immer wahr ist, dass die 
Atome, die den Grat von links überschreiten, von links stammen. Sie 
stammen, wenn das Potentialbild etwas kompliziert ist, zum Teil von 
rechts, gingen dann bereits nach links hinüber und passieren die 
Schwelle nun wieder nach rechts. Doch spielt dieser Effekt wohl in 
den meisten Fällen eine nur untergeordnete Rolle. 

6. Es sei nunmehr (l4a) auf die EckAartsche Potentialschwelle 
angewandt und mit dem Ecxkartschen Resultat verglichen. Der 
symmetrische Teil des Eckartschen Potentials lautet 
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4Vo 
(1 + e7/)(1 + e72) | 2) 
und seine Durchlässigkeit für eine Welle mit der Energie E berechnet 


ECKART zu Ls 
E 
Cosh m Y -- —1 
C 
l— 0 = 


Cosh a Vox -}- Coshz V2 


ist. Die Durchlässigkeit einem Atomstrom in Tempe- 


(19) 


h? 
32 mb? 
raturverteilung gegenüber ist daher 


wo C= 


fa —o)e-PEdE w er + 


0 


(20) 


h? 37, h? ) 
48 mb? | 48mb? 


Die rechte Seite ergibt sich durch Integration, ist aber nur bis 
auf Glieder mit h? richtig. Formel (14a) ergibt 
(l1 h23V, =, 
rl tan Fan) 
also ein mit (20) identisches Resultat. Der Vergleich unserer Formel 
in diesem Spezialfall mit der Eckartschen mag wegen des bereits 
betonten Umstandes nicht überflüssig erscheinen, dass die Ableitung 
von (14a) mathematisch nicht streng ist. 

7. Zum Schluss sei die Quantenkorrektion für die Umwandlungs- 
geschwindigkeit des Parawasserstoffs in normalen Wasserstoff be- 
rechnet. Da die Übereinstimmung zwischen den experimentellen 
Werten!) und den ohne Quantenkorrektion berechneten?) bereits sehr 
gut ist, kann es sich nur darum handeln, nachzusehen, ob die Über- 
einstimmung durch die Quantenkorrektion nicht zerstört wird, wie 
dies zunächst vermutet werden könnte ?). 

In der Tat zeigt es sich, dass die Quantenkorrektion sehr beträcht- 
liche Werte hat, und man kann gar nichtschematisch nach (17) rechnen, 
weil die Näherung mit der zweiten Potenz von h nicht ausreicht. 
Glücklicherweise können die Glieder in zwei Gruppen eingeteilt werden: 
solche, bei denen man mit (17) sehr wohl auskommt und solche, bei 
denen man noch ganz im Quantengebiet ist, so dass man die Existenz 
der höheren Schwingungen ganz vernachlässigen kann. Da die Rech- 
nung etwas langwierig ist, sei sie hier nur skizzenhaft dargestellt. 


(20 a) 


1) A. Farkas, Z. physikal. Ch. (B) 10, 419. 1930. H. GEIB und P. HaRTEck. 
Z. physikal. Ch., BoDENSTEIN-Festband, 849. 1931. 2) H. PELZER und E. WIGNER. 
Z. physikal. Ch. (B) 15, 445. 1932. 
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Die Rechnung wurde (loc. cit.) auch mit Hilfe der Wahrschein- 
lichkeitsfunktion ausgeführt, es wurde aber die klassische Wahrschein- 
lichkeitsfunktion verwandt. Dies erleichtert das Anbringen der 
Quantenkorrektionen, da die entsprechenden Ausdrücke immer nur 
mit einem Faktor (1+ A?...) multipliziert bzw. dividiert werden 
müssen. | 


Die erste Korrektion betrifft die Normierung der Wahrscheinlich- 
keitsfunktion. An Stelle-des Integrals über die klassische Wahrschein- 
lichkeitsfunktion des A, sollte das über die korrigierte Wahrschein- 
lichkeitsfunktion treten, die sich von ersterem durch den Faktor 

h2 2. 

1— a w? (21) 
unterscheidet, wo w die 2rfache Frequenz des H,-Molekiils ist. Nun 
zeigt ein Vergleich der Grössenordnungen der beiden Glieder in (21), 
dass in diesem Fall die Näherung mit h? zweifellos noch nicht hin- 
reichend ist. In diesem Fall ist es aber leicht, für (21) die genaue 


Zustandssumme e-Phoj2 
| Bho 


1—e-fhu (21a) 
zu berechnen. Der Faktor e~°*"? trägt nur zur Aktivierungswärme 
etwas bei, für den Rest kann man bei allen in Betracht kommenden 


Temperaturen = schreiben. In die Formel fiir die Reaktions- 


geschwindigkeit geht das Reziproke hiervon ein. 
Die Korrektion für die eigentliche Gratüberschreitung können wir 
durch den Faktor [vgl. (17)] 
2 32 2 92 2 42 2 92 

1-54.) 1-54) -a)i 54 (22) 
berücksichtigen. Dabei berechnen sich die A zu — 0'44 :105; 2°1-105; 
0°89 -105; 0°89 -10°%. Bei-dem ersten Faktor können wir die Form (22) 
beibehalten, die anderen drei beziehen sich auf die Zustandswahrschein- 
lichkeit auf dem Grat (nicht auf die Uberschreitung), bei ihnen ist 
die in (22) auftretende Näherung nicht zulässig, sie werden analog 
zu (21) durch 

2700 e-1350/7T 1100 e552 1100 e-550/T 


MH Eee I ee ne Te a en ET E un en 2 
T ]-e-?wT T ]_-e-10T T ]_-e-1100/7T (22a) 


ersetzt. Der erste Faktor kann wiederum für alle in Betracht kommen- 


700 A 
den Temperaturen Tr geschrieben werden. 
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Die numerische Berechnung ergibt folgende Tabelle: 


T | ks | Cait Cheu 
283 | 86-104 | 2-108 .| 2.106 
33 | 27.106 | 25.108 3-106 
873 | 12.109 2.10% 3.106 
1023 , 19.109 | 15.108 2.106 


In der ersten Spalte stehen die Temperaturen, in der zweiten 
die von GEIB und HArTEcK bzw. von A. FARKAS beobachteten Ge- 
schwindigkeitskonstanten in Liter/Mol in der dritten Spalte die von 


3 

PELZER und WIGNER aus diesen berechnete Grösse C= k T` ?e "RT 
(mit W, = 6600 cal) in der letzten schliesslich diese Zahlen dividiert 
noch mit der Quantenkorrektion, und mit W,= 5600 anstatt mit 
W, = 6600 cal berechnet!). Der theoretische Wert?) für die letzte 
Spalte wäre 10 -10®. Wir sehen, dass durch die Quantenkorrektion die 
Übereinstimmung zwischen den aus den Versuchen berechneten C und 
seinem theoretischen Wert etwas verschlechtert wurde. Sie war aber, 
wie dies schon (loc. cit.) betont wurde, vor Anbringung der Quanten- 
korrektion sehr weitgehend zufällig, während das jetzt nicht mehr 
wesentlich der Fall sein dürfte. 


Zum Schluss möchte ich es nicht versäumen, Herrn H. PELZER für 
seine liebenswürdige Hilfe zu danken, mit der er mich bei der Aus- 
führung dieser Arbeit unterstützt hat. 


1) Wegen der von (2la) und (22a) weggelassenen Faktoren berechnet sich hieraus 
die Höhe des Grates über dem tiefsten Punkt des Tales zu 9400 cal. 2) Der 
theoretische Wert war von PELZER und WIGNER wegen eines Fehlers in ihrer 
Formel (22) zweifach zu hoch angegeben worden. 
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The probability of a configuration is given in classical theory by the Boltzmann 
formula exp [— V/hT] where V is the potential energy of this configuration. For high 
temperatures this of course also holds in quantum theory. For lower temperatures, 
however, a correction term has to be introduced, which can be developed into a power 
series of h. The formula is developed for this correction by means of a probability func- 
tion and the result discussed. 


1 


N classical statistical mechanics the relative probability for the range 


statistical equilibrium by the Gibbs-Boltzmann formula 
P(x1, ++ +5 Sup Py 0+ y Pn)d%1 ++ -danddı ++ «dpa = ePeday + + -dændpi >» -dpn (1) 
where e is the sum of the kinetic and potential energy V 


2 2 | x | 
tt + Va m) (2) 


2m, ma Mn 
and ß is the reciprocal temperature T divided by the Boltzmann constant 
B = 1/kT. (3) 


In quantum theory there does not exist any similar simple expression for 
the probability, because one cannot ask for the simultaneous probability for 
the coordinates and momenta. Moreover, it is not possible to derive a simple 
expression even for the relative probabilities of the coordinates alone—as is 
given in classical theory by e"?Y@'"* =». One sees this by considering that this 
expression would give at once the square of the wave function of the lowest 
state | Yoli st + Xn) |? when 8 = œ is inserted and on the other hand we know 
that it is not possible, in general, to derive a closed formula for the latter. 

The thermodynamics of quantum mechanical systems is in principle, 
however, given by a formula of Neumann,! who has shown that the mean 
value of any physical quantity is, (apart from: a normalizing constant de- 
pending only on temperature), the sum of the diagonal elements of the matrix 


Qen o A 


where Q is the matrix (operator) of the quantity under consideration and H 
is the Hamiltonian of the system. As the diagonal sum is an invariant under 


1 J. von Neumann, Gött. Nachr. p. 273, 1927. 
| 749 
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transformations, one can choose any matrix or operator-representation for 
the Q and H. In building the exponential of H one must, of course, take into 
account the non-commutability of the different parts of H. 


2 


It does not seem to be easy to make explicit calculations with the form 
(4) of the mean value. One may resort therefore to the following method. 

If a wave function (x - : - x") is given one may build the following 
expression? 


P(x, £ni Putea Prd 
Iye ps 

-7 J. oe f dyii dyaplar + yon tm" 
hr -0 


p(x — yı E ynyert(Pivrt: ++ Pnun)/h (5) 
and call it the probability-function of the simultaneous values of x1: > > Xn 
for the coordinates and $i - - - pn for the momenta. In (5), as throughout 


this paper, h is the Planck constant divided by 27 and the integration with 
respect to the y has to be carried out from — œ to ©. Expression (5) is 
real, but not everywhere positive. It has the property, that it gives, when 
integrated with respect to the.p, the correct probabilities lyx - + + Xn) |2 
for the different values of the coordinates and also it gives, when integrated 
with respect to the x, the correct quantum mechanical probabilities 


& 
Í dieni fue: re x.)e Pitit: +HPntn)l Ady Pare dx, 
00 i 


}2 


for the momenta pı, : * > , pn. The first fact follows simply from the theorem 
about the Fourier integral and one gets the second by introducing xı+Yx« 
= ur, Xk — Yk =V into (5). 

Hence it follows, furthermore, that one may get the correct expectation 
values of any function of the coordinates or the momenta for the state y by 
the normal probability calculation with (5). As expectation values are addi- 
tive this even holds for a sum of a function of the coordinates and a function 
of the momenta as, e.g., the energy H. In formulas, it is 


[SS Ser man abo po) + st a 


P(xı:-- Xn; pitte Dn) 


fo DE Ce ae ru 


+ glaı tn) a: -e e Xadi e dän 


for any y, f, g, if P is given by (5). 


2 This expression was found by L. Szilard and the present author some years ago for another 
purpose. 
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Of course P(xı, © © © , Xa; Du ° © © , Pn) cannot be really interpreted as the 
simultaneous probability for coordinates and momenta, as is clear from the 
fact, that it may take negative values. But of course this must not hinder the 
use of it in calculations as an auxiliary function which obeys many relations 
we would expect from such a probability. It should be noted, furthermore, 
that (5) is not the only bilinear expression in %, which satisfies (6). There 
must be a great freedom in the expression (5), as it makes from a function 
y of n variables one with 2n variables. It may be shown, however, that there 
does not exist any expression P(x - - - Xa; Pı ©- -< $n) which is bilinear in 
y, satisfies (6) and is everywhere (for all values of x1, © © © , Xn, Pi, ' * * » Da) 
positive, so (5) was chosen from all possible expressions, because it seems to 
be the simplest. 


If y(x1, - © - , Xn) changes —. to the second Schrödinger equation 
oy u ory 
ih— = — — + V(%2,°°°, Xn 7 
Ey > = 2m, 3x ( 2; , )v ( ) 
the change of P(xı, © © © , Xn; Du °°: , Pn) is given by 
ðP n pr ðP Baty (Hit Hn temp 
— = DE + YS 
ðt k=l My OX: OL e Ox Mlee An! Op ++ OD,” 
where the last summation has to be extended over all positive integer values 
of \i, © * +, An for which the sum M+M‘ + -+M is odd. In fact we get for 
dP/dt by i and (7) 
oP = f fan: © © dYpe?? lP Wt +Payn) Ih : 
a = 
th 07 (x1 + Vip °° ty Xn + In)" 
4 Fale e ' 6+ fa In) 
k 2m Ox,” 
plti = V1, +++) tn — Vn) 
+ Ylaı + y °° at 9) ———— (9) 
Ox," 


+ — [Vai + yi, +++ Xa Vn) 
— V(x. — Yu °° +5 En — Yn) Wer + Yi, tn bn) W(t = Yu aln yn) 


Here one can replace the differentiations with respect to x, by differentiations 
with respect to yx and perform in the first two terms one partial integration 
with respect to yx. In the last term we can develop V(xı+ Yu ©, Xa +n) 
and re Yi, ** *, Xn—Yn) in a Taylor series with respect to the y and get 


. e? Pt e+Pnyn) lh 
a - fiat fi ee 


4 zA _ O(a + r `, En + Yn)* 
Yk 


Yaı = Yi, En — Yn) 
k Mk 
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IYlzı — °t y En — Yn 
+ plx + Yy, m + ee] 
OV: 


1 Otty yp nee ce Vn 
fe i egy ee EGOS 
h 2 Oxy" SE Ox," X! NE = 1 Yis ’ + y ) 


in 3 


which is identical with (8) if one replaces now the differentiations with respect 
to yz by differentiations with respect to xx. Of course, (8) is legitimate only if 
it is possible to develop the potential energy V in a Taylor series. 

Eq. (8) shows the close analogy between the probability function of the 
classical mechanics and our P: indeed the equation of continuity 


ôP pr OP OV ðP 
at k Mk OX, k OX, OPE 


differs from (8) only in terms of at least the second power of and at least the 
third derivative of V. Expression (8) is even identical with the classical when 
V has no third and higher derivatives as, e.g., in a system of oscillators. 

There is an alternative form for 0P/dt, which however will not be used 
later on. It is 


ð ð 
E Plan mpo p= DS P(x, ° + *, Xn; pip? ++, Pa) 
ði k OX, 


. | | (11) 
+f P fä dj Plau m Pa H int, Pa H GaTe An; 
Jig aga): 
where J (xi, ° °°, Xni Jis °° * Ja) can be interpreted as the probability of a 
jump in the momenta with the amounts jı, * * * , Ja for the configuration 
X1, °° * , Xn. The probability of this jump is given by 
EEE ae EEE D) 

-f fin -dYa LV (x1 + Yi, En F Ya) 
— Vlxı — Yt, Xn — Yn) Je HM niit +unin) (11a) 

that is, by the Fourier expansion coefficients of the potential V(x, © © * , Xa). 


This form clearly shows the quantum mechanical nature of our P: the mo- 
menta change discontinuously by amounts which would be half the momenta 
of light quanta if the potential were composed of light. To derive (11) one 
can insert both for P and J their respective values (5) and (11a) on the right 
hand side of (11). In the first term one can replace p,e?*(7imt--: tpnundih by 


a Cf. F. Bloch, Zeits. f. Physik 52, 555 (1929). 
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(1/24) (0 /dy rje P +++ +Pava/A and then perform a partial integration with 
respect to yz. Then one can replace the differentiation with respect to y by 
differentiation with respect to x, upon which some terms cancel and the rest 
goes over to 


h ð +y, = a 
paua f fina p Hert Yy ° Yn)* a eee 
k 21m | | 
ðY (z — Yis 


aes MEZ + Yu’: > Xn + Yn) er 20) nt (12) 


Ox,” 
which is just what we need for the left side of (11). By integrating the second 
term on the right side of (11) 


ei ies En — Yn) 


T Jf dji: - djn e(2% h) [(pyts) Yıt- 2 *+(pntin) yn] 


ae mf PM ` dza [V (x1 + 21° En + Zn) 
— Va — 21+ + + Xn 2) jei itte, ids 


with respect to z and j one gets because of the Fourier theorem? 


(i/h) f +2 | dyi: - dynplxi + Yyı° £n F Ya) *W(41 — Yi- En In) 
elil pint -eH Paid Ih. [V (x1 + yı -e Xr + Yn) a V(x ee yı er Xp — Yn) | (12a) 
and this gives the second part of the left side of (11). | 
; , 


So far we have defined only a probability function for pure states, which 
gives us the correct expectation values for quantities f(pı''* Pa) + 


g(x,---%x,). If, however, we have a mixture,‘ e.g., the pure states yı, Ya, 
Yz, : : : with the respective probabilities wı, we, ws, + - (with w +w: +w: + 
- =1) the normal probability calculation suggests a probability function 
PS, tts Ens du"; Pn) = > ur Pıaı, oe ty nytt pn) (13) 

A 


where P, is the probability function for ya. This probability function gives 
obviously the correct expectation values for all quantities, for which (5) gives 
correct expectation values and therefore will be adopted. 

For a system in statistical equilibrium at the temperature T=1/kß the 
relative probability of a stationary state y is e~¥s where Ey is the energy of 
Yı. Therefore the probability function is a part from a constant 

3 Cf. e. g., R. Courant und D. Hilbert, Methoden der mathematischen Physik I. Berlin 


1924. p. 62, Eq. (29). 
4 J. v. Neumann, Gött Nachr. 245, 1927. L. Landau, Zeits. f. Physik 45, 430 (1927). 
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P(x: °° Xn; pi: 


Sf fay: - EY (%1 + Yi: Xn F Yn)* 


e FE (xy — yit Inz Daler PINTEN FREVA, (14) 


Now 


>; Wr (ur +++ Un)" f (Ex) Ya (a1 +++ Da) 
A 


is that matrix element of the operator f(H), (H is the energy operator) which 
is in the u, -- + un row and vı - - - v, column. Therefore (14) may be written 
as 


© P(xti +++ Xn; pice? Pn) 


o0 
== f are Í dyı San dyneiltzrtun dit +lCantyn) pn) /h[e-AE] oy oe tunity nn 
u] 


„e` iliy) Prt + (an— yn) Po] Ih, (15) 


so that we have under the integral sign the x: +y1--- &R&tYn; Kı-Yı ' 
Xn—Yn element of the matrix e~*4 transformed by the diagonal matrix 
etl Ptt: +Pazn)/h Instead of transforming e”?# we can transform H first and 
then take the exponential with the transformed expression. By transforming 
H we get the operator (the ? are numbers, not operators!) 


h? 8? 
H = ne >, — + V(x: . ++ ty) erento 


k 2m, Ox? 


which is equal to 


2 n» /ihp, ð H EN 
if = eae 16 
f ji 2 Mi OX: 2m. xg? ( ) 
where 
n pi? 
e = 2, + V(%1,+++ Xn). (17) 
k=l 2mk 


So we get for (15) 
P(x, ° ++ 5 Du, Pa) 


=f f dyes sayal E harpet aa (18) 


By calculating the mean value of a quantity Q=f(pi, < © © , Pa) +g(x1, © + +, Xn) 
by (18) one has to obtain the same result as by using the original expression 
(4) of Neumann. 

If we are dealing with a system, the behavior of which in statistical equilib- 
rium is nearly correctly given by the classical theory, we can expand (18) 
into a power of k and keep the first few terms only. The term with the zero 
power of h is %_,(—B)"er/r! Now e” is the operator of multiplication with ther 
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power of (17). Its x1+91, * > * , Xa+ Yn; Kı yn *** , Xa —Yn element is con- 
sequently 


€(t1 + 91, °° +) Xn Ia)’ölaı + Yi, Xı — y1) > + > O(n + Yn, Xn — Yn). 


As 6 (also 6’, ô”, - - - ) only depends on the difference of its two arguments, 
one can write 6 (—2y:) -- - 6(—2y,) for the last factors and perform the 
integration by introducing —2y,,-:-+, —2y, as new variables. The terms 
with the zero power of h, arising from the first part of (16) only, give thus 


(1/2") I Brelau +++, tn)?/r] = efe/2r (19) 


which is just the classical expression. 

The higher approximations of the probability function can be calculated 
in a very similar way. The terms of e-##, involving the first power of the - 
second part of H only, are 


5 oP Sey (= TOR er (20) 


r=(0 r! p=1 k Mk Ox%k 2mk OXE 


By replacing all operators by symbolic integral-kernels one gets for the 
lity, °° +, Xat Yn; X1— V1, °° o, Xn — Yn element of the operator (20) 


yo J elx: FIn te +! 


r r! pml 
i >| 6(— 2y1) - ++ 8’(— 2yr) ++ + 5(— yn) 
k Mk 
2 
= = ö(— 21) dl 2 yr) + d(— 230) Ka. — Vc y tn T ae 
k 
Now 
r pi eu’ — e” 
Yerrrie ul == yer = T? 
p= p=1 €. E€} — é 


so that the summation over p and r can be performed in (21). By introducing 
again new variables wi, ---, Wa for — 2y, - - +, —2y„ and performing the 
integration one has 


1 D = ð h? | 
ar; m. OW, 2m WR? 


ee (zite Zk — WEIS, + sfn) PRA ebel zk," ‚zrtwpl2," n) 


e(%1, °° Le — Wr, e, En) — Elan o He 30° +, Xn) 


where w.=0 must be inserted after differentiation. The first differential 
‘quotient vanishes at w,=0, as the expression to be differentiated is an even 
function of w;. The second part gives 
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ee h? B? 8%  B3/ de \? | 
EEE) o 
2* k Mk 8 9x 24\ð xk 
In principle it is possible to calculate in the same way the terms involving the 
higher powers of the second part of Ẹ also, the summation over r and the 
quantities corresponding to our p can always be performed in a very similar 
way. In practice, however, the computation becomes too laborious. Still it is 
clear, that if we develop our probability function for thermal equilibrium in 
a power series of h 


Phan: tt, nj pn sd) = e + lift + hhh (22) 


(we can omit the factor 1/2” before e—**, as we are dealing with relative prob- 
abilities anyway) all terms will be quite definite functions of the p, V and the 
different partial derivatives of the latter. Furthermore it is easy to see, that 
frk will not involve higher derivatives of V than the k-th nor higher powers of p 
than the k-th. These facts enable us to calculate the higher terms of (22) in a 
somewhat simpler way, than the direct expansion of (18) would be. 

The state (22) is certainly stationary, so that it would give identically 
ðP/ðt=0 when inserted into (8). By equating the coefficients of the different 
powers of k in ðP /ðt to zero one gets the following equations: 


pr def OV ðe“ 
ee en, (23, 0) 
k mM, Xk k OX, OP, 
ð oV ð 
ee Das (23, 1) 
k My Xk k Ox, OPE 
Dr Ofe OV ðf: ôV h? get 


k Mk Ô Xk k OXk ðpk k 0% 24 ðpr? 
03% h? eebe | 
k#ı Ôx Ôx 8 09401 = 


(23, 2) 


and so on. The first of these equations is an identity because of (17), as it 
must be; (23, a), (23, 2), - - - will determine fi, fa, - - - respectively. All Eqs. 
(23, a) are linear inhomogeneous partial differential equations for the un- 
known f. From one solution fa of (23, a) one obtains the general solution by 
adding to it the general solution F of the homogeneous part of (23, a), which 


is always 
Dr OF OV OF 


k mr 0% k Xk OPE | 
This equation in turn is the classical equation for the stationary character of 
the probability distribution F(x, ---, Xni Pu **-, Pn). It has in general 
only one solution which contains only a finite number of derivatives of V, 
namely 


EEE - X a + a 2) = FO. 


k 2M 


In fact, if it had other integrals, like 
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‚F(di ty Dn; V, dV /dx, dV /Oxe, rite ) (24) 


then all mechanical problems would have in addition to the energy-integral 
further integrals of the form (24) which, of course, is not true. 

One solution of (23, 1) is fi =0 and the most general we have to consider is 
therefore fı = F(e)..We have to take however F(e) =0 as fı has to vanish for 
a constant V. So we get fı=0, as we know it already from the direct expan- 
sion of (18). The same holds consequently for fs, fs, - - - , as the inhomogene- 
ous part of the equation for fs only contains fı, the inhomogeneous part of the 
equation for fs only fı and fs, and so on. 

For f: one easily gets 


path eee le een) Oe 


k 8m; Ox, 24m; \ ð Xk k,l 24mm, ÖxLkðXI 


as a solution of (23, 2) and it is also clear, that this is the solution we need. 
The first two terms of fa we have already directly computed (21), the third 
arıses from terms with the second power of the second part of H. Similarly 
fais for one degree of freedom (n = 1) 


64m2B-? ef, = Halq) [82V"2/72 — BV" /120] 
+ H2(q) [B°V2V""/18 — 2p2V 2/15 — BVIV"'/15 + pV’'"/15] 
+ Ho(q) [B4V"4/18 — 2283V"2V"/45 + 282V"2/5 + 862V'V"""/15 
= 4gv""/15] 
where A, is the r-th Hermitean polynomial and g=6!/*p/(2m)1/2, 
It does not seem to be easy to get a simple closed expression for fx, but it is 


quite possible to calculate all of them successively. A discussion of Eqs. (23) 
shows, that the g in 


Play, du) = A + kge + higit) (27) 


(26) 


are rational expressions in the derivatives of V only (do not contain V itself) 
and all terms of g, contain k differentiations and as functions of the p are 
polynomials of not higher than the k-th degree. The first term in (27) with 
the zero power of h is the only one, which occurs in classical theory. There is 
no term with the first power, so that if one can develop a property in a power 
series with respect to h, the deviation from the classical theory goes at least 
with the second power of h in thermal equilibrium. One familiar example for 
this is the inner energy of the oscillator, where the term with the first power of 
h vanishes just in consequence of the zero point energy. The second term 
can be interpreted as meaning that a quick variation of the probability func- 
tion with the coordinates is unlikely, as it would mean a quick variation, a 
short wave-length, in the wave functions. This however would have the con- 
sequence of a high kinetic energy. The quantum mechanical probability is 
therefore something like the integral of the classical expression e~®* over a 
finite range of coordinates of the magnitude ~h/f. where is the mean mo- 
mentum ~(k7Tm)!/*, The correction terms of (27) have, among other effects, 
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the consequence that the probability for a particle being in a narrow hole is 
smaller than would be in classical statistics. From now on we will keep only 
the first two terms of (27). | 
4 
From (25) one easily calculates the relative probabilities of the different 
configurations by integration with respect to the 2: 


Sf Sen ap m Pao ba) 
| h?g? 1 02V ke 1/aV\?2 
k Mk x 24 k Mk OX. 7 


Hence the mean potential energy is 
1 8V 
Í Ve dx Í 5 — ePYdx f Ve*V dx 
h?B? k Mk Ox,” 


ii 24 2 
Í ePV/dx ( J eV’ dx ) 


1 09V 

bs 

+ h?B k Mk Ox,” 
24 


f eP’dx 


where dx is written for dx, - - - dx„ and the higher power terms of h are 
omitted. Similarly the mean value of the kinetic energy is 


1 0°V 
= Í > — edx 
pr? n hs z mir Ox, 
2 28 CH 24 I en (30) 
k Mk f bY dx 


P= 


(29) 


(1 — BV)e*"dx 


This formula also is correct only within the second power of h; in order to 
derive it one has to perform again some partial integrations with respect to 
the x. Eqs. (28), (29), (30) have a strict quantum mechanical meaning and 
it should be possible to derive them also from (4). One sees that the kinetic 
energy is in all cases larger than the classical expression 3”kT. 


5 


One fact still needs to be mentioned. We assumed that the probability 
of a state with the energy E is given by e—8”. This is not true in general, since 
the Pauli principle forbids some states altogether. The corrections thus intro- 
duced by the Bose or Fermi statistics even give terms with the first power of 
h, so that it seems, that as long as one cannot take the Bose of Fermi statis- 
tics into account, Eq. (25) cannot be applied to an assembly of identical par- 
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ticles, as, e.g., a gas. There is reason to believe however, that because of the 
large radii of the atoms this is not true and the corrections due to Fermi and 
Bose statistics may be neglected for moderately low temperatures. 

The second virial coefficient was first calculated in quantum mechanics by 
F. London on the basis of his theory of inneratomic forces.’ He also pointed 
out that quantum effects should be taken into account at lower tempera- 
tures. Slater and Kirkwood’ gave a more exact expression for the inneratomic 
potential of He and Kirkwood and Keyes’ calculated on this basis the classi- 
cal part of the second virial coefficient of He. H. Margenau® and Kirkwood? 
performed the calculations for the quantum-correction. The present author 
also tried to calculate it by the method just outlined. He got results, which 
differ from those of Margenau and Kirkwood in some cases by more than 
100 percent.!® It does not seem however to be easy to compare these results 
with experiment, as the classical part of the second virial coefficient is at low 
temperatures so sensitive to small variations of the parameters occurring in 
the expression of the interatomic potential, that it changes by more than 
20 percent if the parameter in the exponential (2.43) is changed by 3 percent 
and it does not seem to be possible to determine the latter within this accu- 
racy. 


t F. London, Zeits. f. Physik 63, 245 (1930). 

6 J. C. Slater and J. G. Kirkwood, Phys. Rev. 37, 682 (1931). 

7 J. G. Kirkwood and F. G. Keyes, Phys. Rev. 38, 516 (1931). 

s H. Margenau, Proc. Nat. Acad. 18, 56, 230 (1932). Cf. also J. C. Slater, Phys. Rev. 38, 
237 (1931). 

* J. G. Kirkwood, Phys. Zeits. 33, 39 (1932). 

10 I am very much indebted to V. Rojansky for his kind assistance with these calcu- 
lations. The reason for the disagreement between our results and those of Margenau and Kirk- 
wood may be the fact that they did not apply any corrections for the continuous part of the 
spectrum. | 

In a paper which appeared recently in the Zeits. f. Physik (74, 295 (1932)) F. Bloch gets 
results which are somewhat similar to those of the present paper. (Note added at proof.) 


Contributions to the Theory of the Neutron* 
E. P. Wigner 


(Original in Hungarian with German abstract.) Adalékok a Neutron Elmélltéhez. 
, Magyar Tudományos Akadémia Matematikai és Természettudomanyi 
Ertesitöye 49, 142-146 (1932). Translated from Hungarian by Akos Sebestyén. 


1. Pauli [1], striving to explain the continuous spectrum of ĝ-particles, devel- 
oped a theory of the neutron. It is well known that various atoms of a radioac- 
tive element emit electrons of various energies, despite the fact that both the 
energies of the disintegrating atom and of the products of the decay are always 
the same. The later condition can be concluded from the fact that the energies 
of the a-particles preceding and following the -particle are constant. 

We can interpret this experimental fact in two ways, either the principle 
of conservation of energy has to be abandoned (Bohr) or we have to suppose 
that neutral particles called “neutrons” having a continuous energy-spectrum 
are emitted simultaneously with the @-particles so that the sum of the energy 
of the neutron and the electron emitted by an atom is constant. 

L. M. Mott-Smith and G.L. Locher [2] have pointed out the possibility that 
cosmic radiation consistes of neutrons. Sometime later Chadwick’s [3] has 
brought forward a number of arguments that the radiation emitted by Beryl- 
lium under the influence of a-radiation and detected by Curie-Joliot and Joliot 
consisted of neutrons. Chadwick’s! experiments represent the most definite 
proofs on the behalf of the neutron-theory at present. 

The mass of the neutrons observed by Chadwick is undoubtedly identical 
to the mass of the protons as was already pointed out by Chadwick himself. 
Though directly from the experiments it follows only that the magnitude of 
the mass of these particles and that of the proton is the same. However from 
the atomic weights being always integer multiples of the mass of the proton it 
follows that only very light particles and those of the mass of the proton can 
exist in nuclei. 

Chadwick’s neutrons, which we shall call heavy neutrons in what follows, 
are not suitable to explain continuous -radiation. In G-radiation the atomic 
weight remains constant whereas the loss of a heavy neutron would decrease the 
atomic weight by one. Moreover it would be possible to detect heavy neutrons 
emerging simultaneously with 6-radiation, as experiments show that their ion- 
ising capability is not so small that this would be impossible. Hence if we want 
to explain the continuous ß-radiation without violating the energy-principle we 


* Editor’s Note: At the time that Wigner wrote this article, the name neutrino had 
not been adopted for Pauli’s light neutral particle. The reader is warned that the 
name neutron is often used here for what we now call neutrino. 
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have to postulate another kind of neutron — which we shall call light neutron 
in what follows — and whose mass is so small that it does not manifest itself in 
the atomic weight. For reasons of symmetry it is the simplest to suppose that 
its mass is equal to the mass of the electron. 

Now the main question obviously is: Is there any reason to suppose that 
the ionising power of the light neutron is substantially smaller than that of 
the heavy neutron? Indeed it is possible to bring forward reasons for that. 
In general at big velocities the mean range of heavy particles is shorter than 
that of light particles; examples for this are the @ and a-radiations. Born has 
shown generally that the collision cross-section of neutrons with other bodies 
is proportional to the square of the relative mass (mim2/(mi + m2)) of the 
two colliding bodies. Bohr has pointed! out that this explains why the heavy 
neutrons collide mostly with nuclei; in this case the relative mass is about a 
thousand times larger than for collisions with electrons. The same rule shows 
also that the collision cross section of light neutrons is much smaller than the 
collision cross section of heavy neutrons. Namely it is more probable that the 
most eflective collisions will take place with electrons. In this case the relative 
mass is not significantly smaller than for collisions with nuclei (the ratio is 
about 0,5) however the energy loss is much larger because in collisions with 
heavy bodies the light particle can lose only a small part of its energy due to 
the law of conservation of momentum. 

Consequently the fact that demonstrations of rays of light neutrons ema- 
nating from (@-radiating bodies have been so far unsuccessful is not an argument 
against light neutron radiation. Since every -radiation is continuous we have 
to conclude that together with each electron a light neutron must leave he nu- 
cleus too, or in other words the number of electrons and light-neutrons must 
coincide in every nucleus. 

Supposing this we get rid of a further grave difficulty which seemed to make 
the quantum mechanical treatment of nuclei hopeless. What I am thinking of is 
that neither the statistics nor the proper angular momenta of nuclei correspond 
to the rules [4] that can be deduced for the statistics and angular momenta of 
compound bodies from the basic principles of quantum mechanics using sym- 
metry considerations only [5]. Experiments according to Heitler and Herzberg 
[6] show on the contrary that when particles in the nuclei are counted, electrons 
must be left out of consideration and only the atomic weight, or as it had to be 
understood then the number of protons, determine the integer or half integer 
properties of the statistics and angular momentum. If however we suppose that 
together with each electron a light neutron is also present in the nucleus then 
the total number of particles is even or odd according to even or odd number 
of protons and heavy neutrons as the number of light neutrons and electrons 
is always even. Hence the atomic weight, that is the sum of the numbers of 
protons and heavy neutrons determines the integer property of the statistics 
and the angular momentum. 


1 Thanks are due to Mr. F. Sauter for acquainting me with this hitherto unpublished 
work of Bohr. 
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Perhaps it is needless to stress that these considerations are hypothetical. 
The only concrete evidence for the neutrons is provided by the fact that the 
properties of some radiations very abundant in energy are not consistent with 
the picture we have constructed on the ground of the properties of the electron, 
proton and rays of light known at present. However these properties have been 
detected in the case of radiations less abundant in energy and the extrapolation 
to radiations that are very energetic as the cosmic radiation or the Curie-Joliot 
radiation not known very well yet is by no means safe. Naturally we know hardly 
anything about neutrons yet consequently we may assume their characteristics 
to correspond to those of these radiations. 

Note added in proof. Recently it became customery to suppose [7] that the 
neutron detected by Chadwick is not an elementary particle rather it consists 
of a proton and an electron tightly bound to it. This opinion is supported by 
Chadwick’s observation who found the mass of the neutron to be lighter than 
the proton by about a mass of four electrons, and it is not likely that the masses 
of two elementary particles differ by 2/1000. According to Chadwick the mass 
of four electrons corresponds to the binding energy of the neutron. 

If this conjecture holds its ground and the heavy neutron is not an elemen- 
tary particle then the existence of Pauli’s neutron an elementary particle with 
a mass of an electron becomes very unlikely. 

However, besides that in this case we have no explanation for the anoma- 
lies of spin and statistics and for the continuous ß-radiation, the stability of 
the hydrogen atom also contradicts this assumption. Namely it is possible to 
estimate the time needed for the transition of an H-atom from the state 2p to 
the lowest state, that is to a neutron, if such a state existed at all. This time 
is the reciprocal of the transition probability 


402,3 
64r*e"v r2, (1) 
3hc’ 


where v ~ 4mc?/h is the frequency of the radiation, r is the integral 


se l l J ztbo(xyz) (xyz) dx dy dz (2) 


Wo being the wave function of the electron in the neutron state and 7% is that 
in the 2p state. Since we only look for an estimate of (1) we may write 


do = 1 e 1/240 (3) 


\/ Brag 


where ao stands for the radius of the neutron and 7, is known from the theory 
of the H-atom. For the transition probability we get 


(4) 


5 
647428 m3c? (2) o rme? ia 
=m re 0n a) 
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where on the right hand side we have substituted the radius of the neutron 
by e?/2mc? and Sommerfeld’s constant a is ~ 1/137. Thus for the transition 
probability we obtain 1074 sec™!. 

Naturally the hydrogen atom is not in the state 2p normally and the tran- 
sition probability from the state 1s is zero. If however the H atom appears in 
a compound then the coefficient of the 2p state in its eigenfunction is not zero 
but is about 0,1 (for the probability of the 2p state this gives 0,01). This way 
the life of the H atom turns out to be about one month. 

Hence in the afore-mentioned theory of the neutron not only the theory 
of the hydrogen atom must be abandoned but it must be accepted that the 
formula for the transition probability does not yield a correct value even in 
magnitude without comprehensive reason. 
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BEITRÄGE ZUR THEORIE DES NEUTRONS. 


Von EUGEN WIGNER. 


Wenn man annimmt, dass die kürzlich von Cuapwıck be- 
obachteten Neutronen elementare Partikel sind, wird es nahe- 
liegend auch Neutronen von Elektronenmasse zu postulieren. 
Wenn die Zahl dieser «eichten Neutronen» im Atomkern immer 
gleich der Zahl der Kernelektronen ist, wird bei jedem f-Zerfall 
ein leichtes Neutron frei. PauL, von dem die Neutronenhypothese 
herrührt, erklärt in dieser Weise den kontinuierlichen Charakter 
der #-Strahlung, indem er annimmt, dass die Summe der Energien 
des austretenden Neutrons und Elektrons immer dieselbe ist, und 
so sowohl die Energie des Ausgangsstoffes, wie die des Zerfalls- 
produkts scharf sein kann. Die Anwendung einer (bisher unpubli- 
zierten) Überlegung von Bour zeigt, dass das Ionisationsvermögen 
der leichten Neutronen so klein sein muss, dass man sie mit den 
heutigen Mitteln nicht beobachten kann. 

Auch die statistischen und Spinanomalien der Kerne werden 
durch die leichten Neutronen erklärt. Da die Summe der Anzahl 
der Elektronen und der der leichten Neutronen in allen Kernen ge- 
rade ist, ändern sie weder die Statistik noch die Ganzzahligkeit 
des Spins. 

Wenn man dagegen voraussetzt, dass die Caapwicx-schen Neu- 
tronen lediglich Wasserstoffatome in einem bisher unbekannten, 
besonders tiefen Zustand sind, muss man auch die gewöhnlichen 
Formeln für die Übergangswahrscheinlichkeit aufgeben. Nach 
diesen müssten nämlich die H-Atome einer Verbindung durch 
Strahlung in etwa einem Monat in ein Neutron übergehen. 


(Aus der Sitzung der IIL Klasse der Ungarischen Akademie der Wissen- 
schaften vom 13. Juni 1932.) 


Über die paramagnetische Umwandlung 
von Para-Orthowasserstoff. III 


E. P. Wigner 


Zeitschrift für physikalische Chemie B 23, 28-32 (1933) 


(Eingegangen am 28. 6. 33.) 


Es wird die Umwandlung von p H, in o H, (und umgekehrt) beim Stoss mit einem 
paramagnetischen Molekül berechnet. Die Stossausbeute der Umwandlung (sofern 
die Energie beim Zusammenstoss ausreicht) ist umgekehrt proportional zur absoluten 
Temperatur und quadratisch abhängig vom wirkenden Moment. Der Minimal- 
abstand beim Zusammenstoss geht mit der — 6. Potenz in die Stossausbeute ein. 


Im Anschluss an die vorangehenden Untersuchungen der Herren 
L. Farkas und H. Sacuss£ soll im folgenden die Geschwindigkeit 
der durch paramagnetische Gase bedingten Umwandlung von Para- 
wasserstoff in Orthowasserstoff (und umgekehrt) abgeschätzt werden. 

Die besondere Stabilität des Parawasserstoffes beruht auf einer 
Symmetrieeigenschaft des quantenmechanischen Energieoperators für 
das Wasserstoffmolekül: Er bleibt nicht nur dann ungeändert, wenn 
man sämtliche Koordinaten der beiden Protonen vertauscht, sondern 
näherungsweise auch schon dann, wenn man ihre CArtesıschen Koordi- 
naten allein vertauscht, während die Spinkoordinaten ungeändert blei- 
ben können. Diese näherungsweise Symmetrie!) muss daher vom um- 
wandelnden Faktor beeinträchtigt werden (bei den vorangehenden 
Versuchen vom inhomogenen magnetischen Feld der stossenden para- 
magnetischen Moleküle) und derjenige Teil des durch das Feld be- 
dingten Zusatzoperators wird für die Umwandlung massgebend sein, 
der diese Symmetrieeigenschaft stört. Er entstammt offenbar aus der 
Wechselwirkung der Kernspin mit dem magnetischen Feld und lautet 


Hp (s17 9a (2, Yı21) + 83,0, (%2Ye 2,)) . (1) 


x=xyz 

1) Für das unangeregte H,-Molekül besteht diese Symmetrie viel genauer, 
als die übliche grobe Uberschlagsrechnung zeigt, die nur einen Faktor a2 im Koeffi- 
zienten (a ist die Feinstrukturkonstante), also a* in der Wahrscheinlichkeit des 
antisymmetrischen Anteils im Parawasserstoff ergibt. Dies rührt daher, dass jene 
Matrixelemente der Wechselwirkung von Kernspin und Elektronenbewegung, die 
zwei Rotationsniveaus des unangeregten Moleküls verbinden, verschwinden. Vgl. 
auch K. F. BONHOEFFER und P. HARTEcK, Z. physikal. Ch. (B) 4, 113, 127. 1929. 
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Hierbei ist xp das magnetische Moment des Protons, H (x14121) be- 
deutet das magnetische Feld an der Stelle x,, y,, 2, des ersten Protons, 
die s,, sind die PauLischen Spinmatrizen desselben. Die Grössen mit 
dem Index 2 beziehen sich entsprechenderweise auf das zweite Proton. 
Hierbei wird das paramagnetische Gas als ein rasch veränderliches 
-inhomogenes Magnetfeld beschrieben. Das ist eigentlich im Sinne der 
 Quantenmechanik nicht berechtigt, weil es sich hierbei keineswegs 
um ein äusseres Feld handelt und insbesondere die Rückwirkung des 
H,-Moleküls auf das paramagnetische Gas und ein eventuell vorhan- 
denes virtuelles Moment unberücksichtigt bleibt. Besonders im Falle, 
dass das paramagnetische Gas selber Orthowasserstoff ist, wäre aus 
diesem Grunde eine eingehendere Behandlung notwendig. Im all- 
gemeinen wird aber das Verwenden des en keine allzu grossen 
Fehler mit sich bringen. 

Wenn das Magnetfeld homogen wäre, so hätte (1) noch immer 
die Eigenschaft, dass es ungeändert bliebe, wenn man 2,, Yi, 2, mit 


Xe, Yo, Za vertauscht, ein homogenes Feld hat keine umwandelnde 
Wirkung. Deshalb liegt es nahe, für (1) 


l 
22 iD, G z T So „) (b, (x Yı 2,) ae b, (Xp Ya 25) (2) 
=, YF 
+ (812-824) (Da (419121) —D, (%2 Y222)) 
zu schreiben. Der umwandelnde Teil von (1) ist dann allein das zweite 
Glied in (2), das wir durch 


1 db, 
Q= 5 HPO (812-522) (eea) (3) 
x4 . 


ersetzen wollen. Um die Matrixelemente von Q zu berechnen, können. 
wir annehmen, dass das Magnetfeld in Entfernungen, in die die Pro- 
tonen kommen können, durch das Feld eines Dipols ersetzt werden 
kann, das sich im Molekülschwerpunkt befindet. Dann kann man bei 
entsprechender Wahl des Koordinatensystems näherungsweise 

ob, Bug? “kh 


on, ri 


setzen, wo u, das magnetische Moment des paramagnetischen Mole- 
küls bedeutet. Nunmehr kann man für Q auch 

234 U pi a l . . l . . 4 

(tg (etis) (vty) + y (Sais) (e+ iy) (4) 


schreiben, wo x, =2%,,—%,, und s,=8,,—8,, ist. Die drei Glieder von 
(4) geben alle zu verschiedenen Ubergingen Anlass, so dass sie 
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getrennt behandelt werden können. Es sei hier das erste Glied heraus- 
gegriffen, das Endresultat muss aber dann noch mit 3 multipliziert 
werden. Das Matrixelement, das den Zustand mit der Rotations- 
quantenzahl 7 und der magnetischen Quantenzahl 7 verbindet mit 
dem Zustand mit den Quantenzahlen lJ’ und 17’ 


Supu 
M, yt yy! 2 (y n? ns Sy Z Yr d | (5) 
verschwindet für n+#n'’ oder wenn J’ nicht /+1 ist, so dass y,, und 


Wry verschiedenen Wasserstoffarten entsprechen müssen. Das Spin- 
moment in der Z-Richtung muss für den Orthozustand Null sein, dies 


gibt einen Faktor 3 bei der Umwandlung von Ortho zu Para, wie 


er für das Gleichgewicht notwendig ist. Für die Umwandlung von 
Parawasserstoff in Orthowasserstoff fällt dieser Faktor weg. Man 
kann die Matrix (5) mit Hilfe der gewöhnlichen Intensitätsformeln 
berechnen, für ’=I!—1 ergibt sich 


3a,, u [Eee Te 
wem) 2 l 
M 1-1, = rå (22+ 1) (21 — 1) (6a) 
und für J’ =/—1 


SO la Up (2 
Msn ee J (QI 1) @l+3)’ (6b) 


WO Gy, der Abstand der H-Atome im H,-Molekiil ist. 

Nunmehr kann man die Umwandlungswahrscheinlichkeit direkt 
berechnen. Ein Parawasserstoffmolekül mit der Rotationszahl l stosse 
auf ein paramagnetisches Molekül, der Stossabstand sei a,, die Relativ- 
geschwindigkeit v. Wenn v so gross ist, dass die kinetische Energie 
des Stosses die Energie, die zur Umwandlung nötig ist, übersteigt, so 
kann man nach KALLMANN und LoxDon!) den Stoss ersetzen durch 
folgenden Prozess: Die Stosspartner werden einander unendlich rasch 
bis auf den Stossabstand genähert, die mittlere Zeit des Zusammen- 
stosses beisammen gelassen, dann wieder unendlich rasch getrennt. 
Während des Zusammenseins lautet dann die ScHRODINGER-Gleichung 
für den Entwicklungskoeffizienten c,,, des entstehenden Orthozustan- 
des mit der Rotationsquantenzahl +1, wenn man als Nullpunkt der 


Energieskala diesen Zustand wählt 


dc iy | 
. De, = ir,ıHıd 
thy = Minıtıne (7) 


Hierin ist % die PLancKsche Konstante dividiert durch 2 a und 
14, die Energiedifferenz zwischen den Zuständen Z und l+ 1 dividiert 


1) H. KALLMANN und F. Lonpon, Z. physikal. Ch. (B) 2, 207. 1929. 
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durch ?%, also A En , wobei J das Trägheitsmoment des H, ist. Die 
Lösung von (7) ist | | 
Barr Uatt pI KESU | _in(l+1)t 1) 
1 u ae 


GH = Raus) V ere eres (8) 


Hierin muss noch für ¢ die Zeit des Zusammenseins eingesetzt 
we . e . a * “ « 5 
werden, wofür wir die Zeit „ schreiben, in der das Matrixelement (5) 


von der Hälfte seines Maximalwertes zum Maximalwert ansteigt und 
dann wieder auf die Hälfte herabfällt. Durch Quadrieren und Mitteln 
über n erhält man so für die Wahrscheinlichkeit des Orthozustandes 
i-+-1 nach dem Stoss 

| 1207,40 upJ? , AlI) a 


> ieee er ne nenn Dur, 
a4? = Wak (+N QIF]1 69% (9a) 


und entsprechend fiir die Wahrscheinlichkeit des Orthozustandes mit 
der Rotationsquantenzahl /—1 


12a?, u? u% J? hla 
2_ Ihar P ae 8 
Jey)? = Pale i 63 (9b) 
Die Gleichungen (9) zeigen, dass man im wesentlichen nur mit dem 
Übergang 0 — 1 (und vielleicht noch mit 2— 1) zu rechnen braucht. 
Bei diesen kann man den Sinus durch den Winkel ersetzen und er- 
hält so für die Stossausbeute des 0 — 1- Überganges 


22 
2u? upd 


3A a8 Mv?’ uy 


sofern man nur diejenigen Stösse in Betracht zieht, deren 
Energie zur Umwandlung ausreicht. Setzt man für Mv?=3kT 
und auch für die übrigen Grössen die entsprechenden Werte ein, so 
erhält man mit a,=1 bis 2A Stossausbeuten von 10-4 bis 107%, 
in Übereinstimmung mit den experimentellen Werten. 


Was die Temperaturabhängigkeit anbetrifft, so ergibt der hier 
benutzte Mechanismus ebenfalls ein mit den Versuchen übereinstim- 
mendes Resultat. Zunächst hängt (10) explizite nur dadurch von der 
Temperatur ab, dass die Stossausbeute umgekehrt proportional zum 
Quadrate der Stosszeit ist. Diese würde eine der absoluten Tempe- 
ratur umgekehrt proportionale Reaktionsgeschwindigkeit bedingen, 
der aber ein Abfallen des Minimalabstandes a, (SUTHERLANDsche Tem- 
peraturabhängigkeit des Wirkungsquerschnittes) beim Zusammenstoss 
mit wachsender Molekülgeschwindigkeit entgegenwirkt. 
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Beim Vergleich mit der Erfahrung muss zudem noch in Betracht 
gezogen werden, dass die beobachtete Geschwindigkeit nur der Grössen- 
ordnung nach durch (10) wiedergegeben wird, in der Tat setzt sich 
diese aus der Summe der Einzelbeträge der verschiedenen Übergänge 
zusammen, die sich ihrerseits ebenfalls mit der Temperatur ändern. 
Die genauere Diskussion zeigt?), dass die schwache Temperaturabhängig- 
keit der beobachteten Stossausbeute auf ein Zusammenwirken dieser 
Faktoren beruht. 


Der hier vorgeschlagene Mechanismus scheint schliesslich auch 
die Abhängigkeit vom magnetischen Moment des wirkenden para- 
magnetischen Moleküls richtig wiederzugeben. 


1) Vgl. die vorangehenden Arbeiten von L. Farkas und H. Sacusse. 


On a Modification 
of the Rayleigh-Schrödinger Perturbation Theory 


E. P. Wigner 


Magyar Tudományos Akadémia Matematikai 
és Természettudományi Ertesitöje 53, 477-482 (1935) 


1. The RayLeisu— ScHRÖDINGER perturbation theory? gives 
an explicit power series in A for the characteristic values F,, and 
the characteristic functions øn of a Hermitean operator H+AV 


(H+-AV) on = Fnon (1) 


if the corresponding quantities En and Øn for the «unperturbed 
operator» H are known 


Ho, = Enn. (1a) 
If the so-called matrix elements of V are denoted, as usual, by 
Vam = (Pns Vdm) = == Vs mn (2) 
the first terms of the aforementioned series read 
x var 
ie = En + Vin + A” UE E; (3a) 
Pn = Yn + D E, E-E Pr. (3b) 


Generally only these first terms of the series are used in actual 
calculations, the higher terms become increasingly complicated. 


1 J. W. 8. RayreicH, The theory of Sound. London and New York 
1894, vol. 1, p. 113. E. ScHRÖDINGER, Collected papers on Wave Mechanics, 
London and Glasgow 1928, p. 64. 
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We shall fix our attention on the lowest energy value $`. 
While it is evident that the first approximation for this 
FO = E,+AV,, is always greater than its real amount — since 
it is the expectation value of a normalized wave function ¢,; 
nothing like this holds for the second and higher approxima- 
tions. It even happens quite often that the last series in (3a) 
diverges in cases when the lowest energy value is finite itself. 
In these cases, of course, the RAYLEIGH—SCHRÖDINGER perturba- 
tion theory is inapplicable to the problem. The aim of the 
present paper is to give an approximation formula for F, which 
always yields values that are too high, and which can be proved 
to converge at least in certain simple cases. Such an expression 
is naturally provided for by the variational method which had 
been used frequently indeed in cases for which the general shape 
of the characteristic functions could be obtained by physical 
considerations. | 

The final result, the co-th approximation, will appear in the 
form of an infinite series. This infinite series was first found by 
L. BritLLovin who obtained it by an intuitive consideration of 
the usual scheme. He has already pointed out in his important 
paper that his series converges much more rapidly than the power 
series of SCHRÖDINGER. He has not investigated, however, the suc- 
cessive approximations and their relations to the actual problem. 

9. For the sake of convenience we shall denote further on 
Fe = Ekr + AVin simply by Ek, assuming Vek =O or that the 
diagonal part of V is already put into H. In addition to this, 
we shall put à = 1. The expectation value of H + V— E, for the 


wave function & 
9, = Yı + È kpr (4a) 


is (R means that the real part of the ensuing expression must 
be taken) o 
È (Er— Li) | ax |*+ 2R 2 Vikar + 2 Vrıakaı 


I 7 — BE nun A Ba ee a ee es Erle er 
F — E, BL (40) 
k 


AL, BRILLOUINN, Journal de physique, 4, 1, 1933. The perturbed energy 
first appears as resonance denominator in J. E. LENNARD-JONES article, Proc. 
Roy. Soc. London, A. 129, 598, 1930. 
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where all summations must be extended, as always in this 
paper, from 2 to oo. Expression (4b) must be made to a 
minimum by choosing the a 's properly. 

To orient ourselves we proceed as follows. We neglect first 
the double sum in the numerator of (4b), i. e. assume Vy = 0 
for k >1, 1>1. Differentiation of (4b) with respect to a, 
gives then 


| ü = FOLT . (da) 
This, inserted into (4b), gives 
6 š Koral 
F? — E, = Daa (50) 


This is an implicit equation for F”, which can be solved e. g. 
by plotting both sides of (5b) against Fi”. It is an exact solu- 
tion of the problem if Vw = O (i. e. it is F’ % — F): the finding 
of the characteristic values of a Jacosr matrix.* 

It appears to be natural, now, even if the Vj. are different 
from zero, to try (5a) for aj’ but to use instead of (5b) an 
equation expressing again that I?’—F, is equal to the total 
fraction (4b), which, under the present conditions will no longer 
be equal to the right side of (5b). One obtains, instead of (5b) 


-E =) T Vix V HÈ TE EITE ET V 1k Viet Vu (6) 
FPE; (FR -E)(FP-E) 


The value of F, obtained from (6), since it is an expectation 
value of H+ V, is always 100 high. For the next approximation 
one may try 


2) 1 
ak = ay + Pi 


neglect second power terms in the double sum of (4b), then 
minimize the resulting expression. One obtains, by denoting 
the total fraction this time with F? — E, 


1 (5b) goes over into the usual formula of the RAYLEIGH — SCHRÖDINGER 
theory if E, is doubly degenerate, and E,, E, etc. are far away from E,—=E,. 
However, (55) may be used as it stands, without first solving a «secular 
equation». 
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Vina” 
ake =p T HATE E TOE 5 


which is in this Di equal to 


Vri Vu 
of =D pe: FE — Er EE + FP EEPE (Fi? — Ex) (Fe —E,)(F2—E) E) (7a) 


Inserting this into (4b) again, one obtains 


V VaV VuVu: V 
Fe + Sy TENE + 


a (F®— E) (Fe 2 Fo E) + (7) 
4+ p ar ae = V 


7 
This, solved by a graphical or other method, again gives cer- 
tainly too high values for T. 
It is evident now, how the higher approximations look. If 
one inserts 


alt) — = ah) ii > 7 Vase, un-ılin-2:. Vee Va 


EN (eT 


into (4b), the Pe side, which is the expectation value of a 
normalized wave function minus /,, becomes 


TH Tat + Tint Tones + (N= 1)(f- Ep = 


_ (ee (H+ V) oft”) " 
with | | 
N=1 + 2 ja + D|? (9a) 
and | 
Vin, Vi, joj Vuan Var 
t= a er 


Equ. (9) is an a If one chooses f such that 
Ta + T3 + + Tan+1 = f — E; (8) 
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one sees that the expectation value of the corresponding wave 
function is Just 


Pi 1) — f 


which can be taken as the (n+-1)th approximation. One can 
satisfy oneself easily, that the infinite series, obtained by gene- 
ralization of (8), together with a similar generalization of (8a) 
and with (4a), formally satisfie equation (1) with F =f. They 
can diverge, of course, in spite of this. It is true, however, that 
all equations of the form (8) give too high values for F}. One 
easily sees that the odd term of (8) are always negative,! 
(therefore improving on the energy value), while the even ones 
may be positive or negative. . os 

Of course, (8) converges in many cases in which the usual 
RAYLEIGH —ScHRODINGER method diverges. It can converge even 
in the case of a continuous spectrum which has been made 
artificially discrete by setting finite boundaries to an originally 
infinite problem. There is one, as I think, rather general case, 
in which it can easily be shown to converge. This is when H 
iS an «even» operator, which remains unaltered under a trans- 
formation Q, satisfying the equation Q*=1, and V an odd 
operator which goes over into —V under the same transfor- 
mation. Then Vw is zero if k and l refer either both to even 
states, or both to odd states and the even terms of the series 
(7) drop out. The conditions of .the convergence are in this case 
merely that H+ V should be bounded downwards (the lowest 
energy value finite) and applicable to all characteristic func- 
tions of H. 

Of course, (8), (8a), (4a) still give a formal solution of the 
problem, if one inserts other numbers instead of 1. It is not 
so easy, however, to discuss the resulting equation in these 
cases. 


1 The (2n—1)st term in (8) is equal 
Pag = Era)? (FE) (*) 


which is, of course, negative. Equation (*) is material for the proof of 
convergence in cases to be mentioned later 
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3. As an example, I want to discuss the MATHIEU-equation 


— lp 
Ar + sin x o = Foy (x) 


in which the total potential sin x will be regarded as pertur- 
bation. The e characteristic functions are 


j dies sin x | COs X es 
nu za = Va Yn’ Ya 


and the V;,; matrix is 


0 


1 
y2 


PI 0 -— o 0 
0 -+ 0 = 0.. 
0 0 5 0 -y 
0 oO 0 - > 0. 


The equation for the lowest energy is 
1 1 
Enn earay BUFFET) 
1 
301- FF FE Bu) = FPO- * 
This equation can be solved by successive approximations very 
easily, one obtains F = — 037856. 
Another example of a similar simple case will be given 
by Mr. F. Seitz in the calculation of the Fermi energy of 
metallic electrons. 


-+ 


+ 


(From the meeting of the IlIrd class of the Hungarian Academy of 
Sciences on the 12th November 1934.) 


Felelős kiadó: Wigner Jenő. — Franklin-Társulat nyomdája. 


Calculation of the Rate of Elementary Reactions 
of Light and Heavy Hydrogen 


L. Farkas and E. P. Wigner 


Transactions of the Faraday Society 32, 708-723 (1936) 


Received 15th Fanuary, 1936. 


1. 


The reactions which will be the subject of the present paper are as 
follows : 


kı 

Bes ma e s w aY 
k 

D- D, >D, +D 2. (3 
k 

D+H,>DH+H 2. (3) 
k 


HED HDD . . « wf 
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The first two reactions determine the rate of homogeneous thermal 
conversion of light and heavy para- ortho-hydrogen. The mechanism of 
the homogeneous thermal conversion of light para-hydrogen into ordinary 
hydrogen has been cleared up in an important paper by A. Farkas. 
The rates of reaction (2) together with those of (3) and (4) have been 
measured recently by A. and L. Farkas ? and compared with each other 
on the basis of the usual theory of collision numbers. In the present 
paper we shall compute the reaction rate of the elementary reactions (1) 
io (4) on the basis of more recent theories * 4-5 since, so far, these are 
the only examples providing both theoretical and experimental data 
for a comparison. 

It will be interesting to calculate theoretically the number of inter- 
changes which take place per unit time and volume (sec. litre) and unit 
concentration (mol./litre) of the reagents (H and H, e.g.). We designate 
as interchange a reaction involving atoms and molecules of the general 
type 

A+ BC =AB+C 
where A, Band C represent H or D atoms. The rate of interchange can 
be measured directly, in principle, in cases (3) and (4). In reaction (1) 
only the rate of conversion: 


H + p-H, > 0-H, + H ; : . (1a) 


can be measured directly. In the number of interchanges, as defined 
before, however, the reaction: 


H + p-H, > p-H, + H (16) 


should also be counted. The number of these is 1/3 of those which lead 
to the ortho-molecule, strictly according to their statistical weights.® 
The total number of atomic interchanges is, thus, 4/3 times the number of 
para- ortho-conversions. Similarly the number of interchanges in case (2) 
is three times greater than the number of ortho- para-conversions, which 
can be measured directly in this case.? 

The results for the rates of the first two interchange reactions are as 
follows : 


TABLE I. 
T. ` kı. Re. 
2837 1:3 X 105 — 
373° 3:1 x 108 — 
873 1:37 X 10°? 0:70 X 10° 
930 I'71 X 10° 0:53 x Io? 
973 2-00 X I0? I-02 X Io? 
1023 2°35 x 10° — 


1 A. Farkas, Z. Elektrochem., 1930, 36, 782; Z. physik. Chem. 1931, 10B, 
IQ. 

i 2 A. Farkas and L. Farkas, Proc. Royal Soc., A, 1935, 152, 124. 
3 Pelzer and Wigner, Z. physik. Chem., 1932, 15B, 445. 
+E. Wigner, tbid., 19B, 203. 
5 Eyring, J. Chem. Physics, 1935, 3, 107; E. Evans and M. Polanyi, Trans. 
Faraday Soc., 1935, 31, 875. 

S$ A. Farkas, ‘“ Ortho-hydrogen, para-hydrogen and heavy hydrogen,” Cam- 
bridge, 1935, p. 66. . 

7 K., H. Geib and P. Harteck, Z. phystk. Chem., 1931, Bodensteinband, p. 849. 
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The ratio of the reaction rates of (1) and (2) according to Table I. is about 
195. This differs somewhat from the ratio given previously 2 (which 
was 1-8) since the higher D atom concentration in 95 per cent. o- D, com- 
pared with that in ordinary D, is now taken into account. The increase 
of the H atom concentration in 50 per cent. p-H,, which was used in the 
experiments on the conversion p-H,->o0-H,, can be neglected. The 
increase of H and D atom concentcation in p-H and o-D, respectively 
is due to the comparatively low entropy of these gases. 

In the case of reactions (3) and (4) it is not possible from the present 
data, to calculate both k, and k, separately ; only one equation for k; and k, 
can be derived. The reaction investigated was the formation of HD 
from H, and D, which had the mechanism : 


H, = 2H : ' . . . (a) 
D 2D . 2 8) 
HD=H-+D 7 J i ‘ (c) 
DH -SDEEH s e o « @ 
5 
Rg 
H+D,HD+D . oo. . (9 


6 

The content of HD in the reacting mixture was measured as function of 
time under different pressures and temperatures. The velocity con- 
stants of the three dissociation reactions will be denoted by Raa Ra Ras 
that of the reverse, recombination reaction (which occurs at triple col- 
lisions) by Ry, Fy, k,e The constants of the reactions (3) and (4) are k, 
and k, in the sense from left to right, and k; and %, in the reverse sense. 
Since the number of atoms is certainly low compared to the number of 
molecules, the number of the reactions (3) and (4) is greater than that oF 
(a), (b) and (c) and the ratio [H]/[D] will be determined by them. 

[H] _ k, [Ha] + %,[HD) _ E (s) 

[D] ~ %,{DH] + kD] : 
The total.number of atoms will be given of course by reactions (a), (b) and 
(c), these giving their stationary concentrations : 


kalHı) + ba» [Ds] + Rael HD)? 
[H] = { Rea + kma? + Roet l ° (6) 


Now we assume that the same fraction of collisions of H + H and D + D 
and H + D leads to a recombination. This is, of course, only approxi- 
mately true. Thus &,, = kn. Furthermore k,, = 2k,., since the number 
of collisions between n atoms H and z atoms D is twice as great as between 
n atoms H. From the dissociation equilibrium we obtain : 


Raa __ . kæ an ; e 
Eee a» ky Te) hae e ° è s (7) 


which yields, inserted into (6) 
[H] + [P] = V K.{H,] + KD] + 2K.[HD] . - (8) 
With the figures for the dissociation constants at T = 1000° we obtain 


2 ı DJ, _z{HD] 
CH) + [D] NE DENIRO 
[HD]? 
[H [D] 


where K = K is approximately 4 for 1000° abs. 


140 Part I. Physical Chemistry 


L. FARKAS AND E. WIGNER 711 


The original concentration of H, will be denoted by c. Since the 
concentration of D, was in all experiments practically equal to that of 
H,, the momentary concentration are [HD] = 2cy, that of 


[HA] = [D,] = e(r — y) 
Thus we have finally the following equation : 


[H] + [D] = VK. V1 TEN: 706576157, ae 125VK.c (9) 
This means that the total concentration of atoms is hardly changing at all 
during the reaction. The numerical value for [H] + [D] is the same as 
hitherto used.” We assume furthermore that the ratio [H]/[D] is always 
k,/k, as in the beginning of the reaction. 

The kinetic equation for the rate of formation of HD is then 


2c = (R,[D] + 4,[H])c(1 — y) — (ACH) + 4.(D])2ey . (ro) 


Integrated, this gives 
a 
Yo ze le a . . . . (11) 


where a = }(k,[D] + &,{H]) and b = ($k, + kJ[D] + ($k, + k [H]. In- 
troducing the calculated values of [H] and [D] and expressing k, and kę 
by A, and k, and the equilibrium constant * of the reactions (3) and (4) we 
have . | 


125. VKycksk; 


a = k, + hy . . . . . . (12) 
1:25 V K,c 
b = Tok (kski + 0°38 kè + 0-672,7) . . (13) 


The time of half-change of the reaction is determined by b. According 
to the experiments ? this time of half-change is 1-5 times greater than that 
of reaction (1) at the same pressure and temperature (in the region 800°- 
r000°). Thus 
p= kı V K,2¢ | 
a ci : : : . (14) 


The factor 2 enters under the square root because at the same pressure the 
concentration of H, is twice as great in pure H, (as used for reaction (1) 
than it is in the mixture used for the reaction (3) and (4). 
Hence we have | 
Dr kaka + 0:38k,? + 0:67h4° 

O75 kı = ays i A . (15) 
which is the only equation for k and ką derivable under the above mentioned 
approximations from the present experiments. Into this equation we 
shall insert the theoretically calculated values of k, and k, and see whether 
it checks. 


According to the usual collision number theory, the rate of reaction 
should vary with the temperature 


datdm\? /8RT(M,+M,,) . 
in: 20( a Im A ee -WIRT 
k = 6:06 10%( A ) „MM, .S.eWIRT | (16) 


Here d, and dm are the diameters of the colliding particles (atom and 
molecule), Ma and M m their masses. S is a steric factor, W the heat of 
activation. For reaction (1) in the temperature region 873°-1023° abs. 


Calculation of the Rate of Reactions of Light and Heavy Hydrogen 141 


712 REACTIONS OF LIGHT AND HEAVY HYDROGEN 


one obtains for W = 5500 cals., S = 0:07. In the low temperature 
region the figures are W = 7000 cals., S = 0-13. Although this is not 
very unsatisfactory it is obvious that the formula, using rather crude 
assumptions of the kinetic theory of gases and containing two arbitrary 
constants S and W, cannot represent the true state of affairs, since a 
strict calculation of the reaction velocity constant must be based on the 
general methods of statistical mechanics. 


2. 


A real theory of reaction rates should contain no arbitrary constant 
at all. Attempts in this direction were made in the following way. 
London 8 first introduced the idea of the energy surface. The con- 
figuration of a set of atoms (the three H’s in (1)) can be described by the 
distances between them. The energy of the atoms in a configuration is 
thus a function of their distances 


E = F(riz 132) 


from which their motion during the reaction can be determined to a 
good approximation by classical mechanics. Classical mechanics can 
be applied because of the comparatively large masses of the nuclei. 
The equation of motion, e.g., in the x co-ordinate is 


pe? 2E 


It should be emphasised that E can be calculated only by quantum 
mechanics and that it contains the energy which the electrons have if 
the configuration is characterised by 19, 743, 23. Since this energy does 
not depend on the masses of the nuclei, the energy surface is the same for 
reactions 1 to 4. Of course there are, according to the different quantum 
states of the electrons, many such energy surfaces, but here, in case of 
chemical reactions, we shall be concerned with the lowest one only. 

In Fig. I which is taken from the work of Eyring and Polanyi? 
the contour lines of the energy surface of three H atoms are plotted for 
a straight configuration 73 = fiz + fa, atom 2 being in the middle. 
The angle between the axes rig and r,, is chosen in such a way that the free 
motion (motion under the influence of the gravity) of a small heavy ball 
on the surface of Fig. 1 corresponds to the motion of the atoms in a straight 
line.!% The valley on the right of the figure corresponds to a molecule 
formed by the atoms 2 and 3 (rə small) 1 being free. The valley towards 
the top of the figure corresponds to the molecule formed by the atoms I 
and 2, 3 being free. If the system performs the reaction 


HO) + H,(2» 3) > H,» 2) + 4%) 


the point representing the state of the system must go somehow from the 
top valley to the valley in the right. The dotted line represents one such 
path. The following is the picture given by London for exchange 
reactions. 


3 London, Sommerfeld Festschrift, S. Hirzel, 1928, p. 104. 

® Eyring and Polanyi, Z. physik. Chem., 1931, 12B, 279. 

10 Professor O. K. Rice has kindly pointed out that this angle is 60° for three 
equal masses, instead of 120°, since the sign in formula (25) Z. Physik. Chem., 1931, 
12B, 279 is incorrect. 


142 Part I. Physical Chemistry 


L. FARKAS AND E. WIGNER 713 


The energy of the system is the energy of the heavy ball and it is 
evident that for very low energies a passage from one valley into the 
other is impossible. Only if the energy of the system exceeds a certain 
limit—that of point X in our figure—is such a passage possible. Points 
x are the highest points of that path leading from one valley into the other, for 
which the highest point is as low as possible. Generally this path has only 
one highest point—its energy is the activation energy—only in our sym- 
metrical example there are two equally high points. The system in the 
activation point is called the ‘‘ actiwated complex” or simply “ complex.” 
It is evident therefore that the activation energy is given, if the energy 
surface is known, and this was aimed at in the pioneer work of Eyring and 
Polanyi.® It should be emphasised, however, that the present more 
accurate experimental data make more exact calculations desirable. Of 
course Fig. I represents, as already mentioned, the energy surface only 
for the straight configurations. For all configurations a three-dimensional 
surface would be necessary, but nothing would be changed fundamentally. 


°7 98 09 10 H 12 RB 19 vol 
Fic. 1. | 


If we consider a system of three atoms in a finite box, the energy 
surface must be cut off at a certain point corresponding to the size of the 
box. The ball on our energy surface would roll around for some time 
in one valley, would pass over into the other, etc. The number of 
passages per unit time is equal to the number of atomic interchanges. 

It is not easy to calculate this number exactly. Jt is quite possible, 
however, as we shall see, to calculate the number of passages over the lines 
through the activation points X, where the energy surface forms a saddle. Now, 
in our case not every passage over a saddle will lead to an atomic inter- 
change. After every passage through one of the saddles the ball will roll 
around in the little hole of the middle of the figure and after a certain time 
may go on into the second valley or else may return to the initial one. The 
relative probability of the first event will be the ratio of the number of atomic 
interchanges to the number of passages through the saddle. 

Although the number of passages through the saddles will be 

© 25 
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calculated sufficiently accurately this ratio can be estimated only. It is 
evidently rather near to $ in case the atoms 2 and 3 are equal, because 
the ball in the hole will go out in both directions with equal probability. 
If the atoms 2 and 3 are different (one D, the other H), though the poten- 
tial surface is unaltered by this fact, the passage through both the 
barriers is for some quantum theoretical reasons not equally probable 
over the two saddles. We shall return to this point later. In this case 
the number of atomic interchanges will be only slightly smaller than the 
number of passages over the saddle which is more difficult to cross. 


3. 


The calculation of the number of passages through a given saddle can 
be performed by the so-called transition state method.44 A small strip 
in phase space is considered perpendicular to the direction of the ball's 
motion during the reaction. The probability of the ball being in this strip 
can be calculated by ordinary statistical mechanics. In the classical 
theory this probability is given by 


— strip 
yi — 
vol. 


In the numerator the integration must be extended over the strip, in 
the denominator over the whole phase space, V is the (potential) energy. 
The number of passages is obtained by multiplying this quantity by 
the mean component of the velocity of the ball which is perpendicular 
to the strip, and dividing by the width of the strip. 

This classical theory must be corrected, of course, for quantum effects. 
There is the zero point energy, the quantum-mechanical leakage through the 
barriers and finally the jump from one energy surface to the other, which must 
be taken into account. 

The formula which takes the first two effects into account * and 1s valid 
for exchange reactions in which the activated complex is straight is: 


(Mat Mm\ Fl, (+ a _eigne 
k= y} MM m I > kT °° ' 1000 


x en By, : (1 se $8?v,") (17) 


Here, M, and M,, are the masses of the colliding atom and molecule 
respectively. J, is the moment of inertia of the activated complex, 
Im that of the molecule, Q the activation energy, N, the Avogadro 


number, B = T Vm is the vibrational frequency of the molecule which 


corresponds to 12,500 cals. /mole energy in H, and 8900 cals. /mole in D,, 


11 For Ne history of this method, cf. Evans and Polanyi, Trans. Faraday Soc., 
1935, 31, 575. 

_* This formula is identical with that derived by Pelzer and Wigner ? and 
Wigner,‘ and is practically identical with that of Eyring.’ Eyring’s formula, 
however, does not contain the zero point energy of the complex explicitly and also 
neglects the leakage through the barrier. 
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va is the deformational frequency of the complex and v, the other stable 
frequency of the complex, whereas v; is the imaginary frequency of the 
same: vw, can be calculated from the curvature of the potential surface 
in the same way as the real frequencies, except that the second derivative 
of the potential surface which is negative in this direction must be taken 
with the positive sign. k is the reaction velocity constant in mole/litre 
sec.; y is the above discussed ratio of the number of atomic interchanges 
to the number of passages through one of the saddles. 

The first line of formula (17) takes into account the classical quantities 
only, the second one contains the effect of the zero point energy of both the 
molecule and the complex and the effect of quantized vibrational levels. In 

I ea hy [2k 
2sinh By 1 — eWikT 


the numerator is due to the zero point energy $hv, the denominator is the 
sum of states for the vibrational states.* The last factor (1 + }B?»,?) 
contains the “ tunnel ” effect. In consequence of the negative curvature of 
the energy surface perpendicular to the strip, some passages will occur by 
leakage even if the energy of the ball is not high enough to reach the activation 
point. This manifests itself in the last factor virtually as a negative zero 
point energy of the activated complex, as it facilitates the reaction. 

The deformation frequency is doubly degenerate and vy enters as a 
squared term into (17). A straight 3 atomic molecule has 4 vibrational 
(va, Ya, Ys, V) 2 rotational and 3 translational degrees of freedom, al- 
together 9 degrees of freedom. 

If the energy surface of §2 is known all quantities entering into (17) 
may be calculated, thus (17) differs considerably from the formula of 
kinetic theory. The activation energy Q is the difference between the 
potential energy of the activation point and the bottom of the valley 
from which the reaction starts (without zero point energies). 

For the calculation of vibrational frequencies, the potential energy 
in the neighbourhood of the activation point must be known up to the 
second power terms of the distances. This will be assumed to have the 
form: 

V = Q + Prez — 0°85)? — g(r — 1:25)? + sd? 
=O+ pl —gPr— mts? 2. (18) 
the activation point having the co-ordinates 7.3 = 0°85 Å., m5 = 1°23 À., 
ô is the deformation angle 123. The fact that no (ria — 1'25) (rg, — 0°85) 
term enters is not evident, a priori; but can be seen from the figure, 
p, q, s are positive constants the values of which can be derived from the 
Fig. 1. 
"For the vibrations,1? v; and v, ô = O can be assumed and the molecule 
put into the x-axis. The kinetic energy can be written then if we denote 
by M,, M, and M, the masses of the three atoms 


A P f Mx, + M% + Myx 

= HM à? + Maža? + Mai?) = ut Zee For 
K 3 1%" 2 3%.) 2(M, + M + M) 

M (M, = Ma) (X, J x) | MM, + M o)(%3 = Xo)? 


+ — aM, F M, + MA 2(M + M: + M3) 
q MM ala — il, — &) (19) 
M, + M, + M, a 


* The vibrational states are considered as “ harmonic.” 
13 Cross and Vleck, J. Chemical Physics, 1933, 1, 350. 
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which gives, if one omits the first term, assuming that the centre of mass 
ıs at rest: 


K= ee). en M,(M, + Ma) A $ M,M3€n 
2(M, + Ma F My)" © (M, 4 Ma + M) '"M+M+M, 
= pé? + fuh? + pé) ; ) : 2. (20) 


If the last term were not present, the motion of the ball could be described 
in rectangular co-ordinates. 


The equations of motion are hence I S = — a etc 
ee) 2 2. a) 
BE + poy = + 29 
Assuming 
¿E = a? cos 2nvt, n = b? cos 2rv 
we obtain : . 
4nty? — pho — ey V (guy — pha)? + 49P (hal — u?) . (22) 


(uitz — p’) 
One of the two values of 47v? is positive, corresponding to the stable 
vibration v,, the other negative corresponding to 


Keal the unstable vibration ve 
The values for p and q can be estimated from 
24 the Fig. 2 which shows the two cross-sections of 


the saddle, p = 3:105 cals./mole Ä.?, q = 4-10? 

cals./mole A.2. For the reaction (I) this value of 

p and q gives Yıv, = 5300 cals./mole, #hv, = 1840 

13 cals./mole. There is no objection on theoretical 
grounds against the first figure. A glance at 
Fig. 2b shows, however, that a negative zero point 
energy of 1840 cals. is not possible in this case, 
since the energy—distance curve of Fig. 2 does 
22 not continue to be even nearly parabolic thus 
far. We have arbitrarily decreased ñv; to 1000 

cals./mole in the low temperature region. It 

| should be mentioned that it is not possible to 
24 justify this procedure at present, but it hardly 
effects our results. Eckart 1% has shown how to 

calculate the non-classical penetration of potential 

barriers of rather different shapes and Bell !* has 

used his results for 

20 the reaction H + Hg. 
b It must be men- 

tioned, however, 
that Eckart’s results 
apply to a one- 
dimensionalproblem 
only and the fact 
that Bell omitted to 
take the zero point 
energy hv, of the 
perpendicular vibra- 
tion into account 


49 


o] 09 09 10 ft 12 43 LY A 


Fic. 2. 


18 Eckart, Physic. Rev., 1930, 35, 1303. 
1¢ Bell, Proc. Roy. Soc., 1933, 139A, 466. 
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invalidates his results to some extent. On the other oa the method 
for many dimensional barriers * is applicable only if chee 2 are not great, 


i.e., in the high temperature region only. Here (17) has been used conse- 
quently, with hv; = 1840 cals./mole. Since the formula, though not 
sensitive for hv,, the tunnel effect being always comparatively small, is 
rather sensitive for 4hv,, the latter has been determined from the measure- 
ments (cf. section 4) and p calculated backwards. From this new p the 
thy, were obtained for the other reactions since the ratios of kv, and hv; 
are practically independent of the value of p. These ratios, as obtained 
with aid of formula (22), are listed in Table II. 

While the displacements are in the direction of the internuclear line 
for the vibrations v, and v,, they are perpendicular to this in the deforma- 
tion vibration vg. We shall denote these displacements by Yı Ya, Ya for 
the atoms I, 2 and 3, the condition that the centre of gravity is at rest 
and y = Q, is then 


My, + Mey, + My; = 0 - . . (23) 
We can put down the condition, furthermore, that the molecule has no 


angular momentum around the middle atom (in virtue of the previous 
equation this holds, then around any point) : 


My, 1:25 — Magy, 0°85 = 0 ; : . (24) 
The deformation angle 8 is 
Sad ae ae Ya — Y2 
2 =e + Sss 0°85 


and the potential energy sô?. The equations of motion are: 


is dV 256 
ee Sy = — Tg 


and similar equations for y, and y, which are automatically fulfilled, 
however, if we assume (23) and (24) from the beginning. These equations 
allow us to express y3 and y, by yı and thus d by y, alone. The equation 
of motion one obtains thus, is: 


SA = Yı . . . . . . . . a (25a) 


I Ey - I I \21 
ew ers Fe M, T O85? M, +(; T = A - (258) 


the deformation frequency is consequently 
I ae 
Va = a (e 4 . e . . . (26) 


The constant s should have, according to the calculated energy surface, 
the value 9000 cals. /mole. This gives hva = 2150 cals./mole for reaction 
(1). It should be mentioned, however, that the value of s is rather 
sensitive to small changes of the constants from which the energy surface 
is derived, and not too much stress should be laid on this value of kva. 

The ratios of vq for the different complexes are listed also in Table Il. 
in which the symbol means (e.g., D—DH) the activated complex of 
reaction (4) in which the H is nearer to the central D atom than the 
other D. 
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This completes the calculation of the frequencies. The ratio of the 
frequencies for reactions (1) and (2) is always V2, independent of the 
actual model assumed. 


TABLE I. 
Relative. 
Symbol. | jan e: 
Vs vi. Va 
H—HH . 5°4 1:77 2°43 3:71 X 10780 
D—DD . 3 8 1°25 1°72 7:42 x 19740 
D—HH . : ae 5'4 1'42 2°37 5°25.xX 107-49 
DH—H . ; 5 4°6 1°38 2:39 4°90 X 10-9 
H—DD k 3'8 I 60 Sr 4 go x 10749 
HD—D . 4°7 1-30 toI 5°52 X 107% 


The moment of inertia of the complex /, which is necessary to cal- 
culate the velocity constant according to formula (17) can also be readily 
obtained from the energy surface. As already mentioned the distances. 
of the end atoms from the central atom are 0-85 A. and 1:25 A. In the 
symmetric case there is only one momentum of inertia, in the asymmetric 
ones, the momentum of inertia of the complex is different in the two 
activation points. 

If x is the distance of the centre of gravity from the middle atom, the 
momentum of inertia is given by 


I, = M,(1:25 — x)? + Max? + M,(0°85 + x)? 
and 
ge M, 1°25 = M, 0°85 
M, + M, + M; 
I, does not depend very much on the dimension of the complex. The 


moment of inertia of the H, molecule is known from spectroscopical 
data. It is 0-467 x 107% g, cm.? and is twice as great for Dg. 


(27) 


4. 


In fact the energy surface for reactions I to 4 is not known at present 
with sufficient accuracy, especially not for the calculation of the constants 
for which (7) is sensitive, t.e., Q, v, and va. Therefore another procedure 
will be adopted in this section: a part of the experimental data will be used 
to calculate the constants of the energy surface relevant for these quantities. 

First of all we shall compare the reaction rates of (1) and (2) in the 
high temperature region. This ratio is, according to Table I, 1°65 at 
950° abs. and we obtain the following equation : 


kı sinh 3°29 er pana! sinh Bv V2 (1-+4(370CB)”) * 


Be = 1-05 == 23/2 - TE a ; 1 2 
ka © sinh 2-34 sinh ßv, sinh v, (1-+75(3700ß)?) 
(28) 


* This formula for the ratio of the velocity constants of the two isotopic 
reactions is essentially the same—if we neglect the tunnel effect—as the usual one, 
derived from the kinetic expression (16). : 

The difference in the “ over all ” activation energy of the two isotopic reactions 
is at low temperatures 0-15(hy,, — hv, — 2hva). 

[Continuation of footnote on next page. 
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if vgand v,are the constants of the complex HHH and 3700 cals., according 
to the calculation on page 715, v; for HHH. Since By, is certainly not 


greater than unity, we can put sinh x = x + 5 and thus we obtain 


finally : _ 
sinh Bv,/V’ 2 


sinh fn, > 


which gives By, = 1'9 and hy, = 7200 cals. /mole. 
In order to calculate the energy of the activation point Q we shall use 
the experimental figures for the temperature dependence of &,. From the 
figures of the Table I. yields 
dink, 


Twerp. = Tr = 3:4 to 3°5 ur . (29) 


d\n k 


whereas theoretically we obtain for the expression T aT from equation | 
(17) 
dink 1,0 
fe | + RT Pvm coth Bvm + 2ra coth Bva + By, coth Br, 
EEE LER 
a 09 


Inserting the values for vm, v, and v; and putting 2v4 coth Bva approxi- 
mately * 2:20 we obtain by comparison with the experimental value for 


Q/RT = 3-4 and Q = 6400 cals. 


According to this calculation the energy of the activation point is rather 
low and is not in good agreement with the calculated value? which is 
20,000 cals. However the first approximation calculations of chemical 
bonds are generally uncertain by this amount. The remaining unknown 
frequency va will be determined from the absolute value of the reaction 
rate of (1) at 950° abs. 


l 9 _ 24 x 10M 60 sinh 3-29 1:16 
£05 10 30.8 ne sinh 1:90 ` (sinh fva)? (31) 


This equation solved yields va = 0:74 and thus hva = 2800 cals. /mole. 


In the ratio of the two velocity constants the factor 


_ sinh By, sinh By,// 2 (se Bral 2) 2 
sinh ymi 2° sinh py, `’ sinh fva 
I 


I 
(hvm — hv, — 2hva) (= _ —) ae 
tends at low temperatures to e V2/ 2RT 


The factor 25/2 in the ratio a is partially compensated by the term 
2 
e& AR 2) 2 
sinh £r, ‘ 
and becomes (at high temperatures) V2. In the kinetic formula this VZ arises 


from the lower number of collisions in the heavier isotope. 
* For small fva, 2Pv, coth Bv, I 2. 
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5% 


In this section we shall compare the theoretically calculated values 
for k, and k, with the experiments and calculate the reaction rates of 
(1) and (2) at low temperatures. In the case of H + H, = H, + H the 
theoretical figures can be checked also by the experiments. 

k, and k, can be estimated on basis of the values of Q, vva and », 
obtained in ‘the preceding sections with aid of the Table II. In these 
cases, however, the molecular constants of the complexes in the two 
activation points are not equal, and consequently also the number of 
passages over the two saddles will be different. The number of passages 
over the two saddles of the reaction (3) is ence by 


(sinh 2730ß)? sinh 72008 
sinh 124008(1 + 4288028?) (, . 
(sinh 27508)? sinh 6140 ß 


(D — HH): n' = 3°6 x 108( -£ Aw. (32) 


3/2 
(DH — H) : n” = 36x 108(-4) 10-50 


l le 
5 ie 9 mo 
which yields for ną 2°95 X 10 litre see 
and for 
TREPA o _mole _ : 
n3 3:5 X 10 litre sec. ae 


Analogous formula for the reaction H + D, gives for the saddle (H — DD} 


jae Son 
litre sec.’ 
and for the saddle (HD — D) 
re agi 
litre sec. 


at the same temperature. 

The number of atomic interchanges which in the symmetric cases was. 
calculated with y = $ from the number of passages over one of the 
saddles, in the case of the reactions (3) and (4) can only be estimated. 
Since for n’ = n” y = }, and for the other limiting case if n’ > n” with 
y = I, k = n”, we shall put in both cases y = ł, and thus %, and k, 
become 


mole 
k, = 2'2 X 10? ——— 
litre sec. 
mole 
k, = 127 X 10° — 
4 7 litre sec. 


In order to compare these theoretical values for k, and k, with the ex- 
periments we have to insert them into the formula (15) of section 1. 
The right side of the formula yields 1:65 X 10? in agreement with the 
experimental value 1:4 X 10°. The agreement must be considered 
quite good, especially since the right side of the formula (15) is propor- 
tional to y, which was chosen rather arbitrarily $. Unfortunately, lack 
of more experimental and theoretical data about the reactions (3) and 
(4) makes it impossible for us at present further to check formula (15) 
for a wide range of H, and D, concentrations, and also for different 
temperatures. 
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It is remarkable that the velocity of reaction (3) is the fastest among 
all elementary reactions of hydrogen. This is caused by the compara- 
tively low zero point energy of the activated complexes D — HH and 
DH — H. This was predicted by Polanyi 15 on qualitative considera- 
tions. 

In Fig. 3 the energy diagrams for all the reactions (1 to 4) are sketched. 

In the initial state the zero point energy difference of H and D is neglected 
(the zero point energy difference of H and D is 84 cals., H having with 
this amount greater ionisation energy), in the activated state only the 
positive part of the zero point energy is taken into account. Qualitatively 
the energy levels of the com- 
plexes are in the same order HH-H 
like those which, in the paper DH-H 
of Farkas and Farkas were 
calculated on basis of rather 
crude assumptions about the 
force constants of the acti- 
vated complexes. The main 
difference in the absolute 
values of the zero point 
energies arises from the in- 
clusion of the deformation 
frequencies which were ne- 
glected in cf. 2. 

Much less 
satisfactory is gy 
the agreement 2 
between the 
theoretically cal- 
culated rates for D 


the reaction (1) 2. 
and the experi- 
mental figures at 
+ 
> 


low tempera- 
tures, as given in 
Table I. For the 
reaction (2) thus 
far no experi- 
mental data are 
known at low 
temperatures.* 

With the con- 
stants, Q, v, va obtained in section 4 and v; which was derived from 
the rough energy surface, the calculation of the rates of the reaction (1) 
is as follows : 

The tunnel effect in this region as mentioned on page 715 is sufficiently 
accurately given by (1 + 1/6(2000ß)?) which factor has at T = 283° abs. 
the value 1°52 and, at T = 373°, 1-30. At this temperature region sinh x 
can be replaced by łe” and we obtain, inserting the numerical values into 


(17), 
18 Polanyi, Nature, 1934, 133, 26. 


* According to the theoretical] formula (17) the ratio of k,/k, should at room 
temperature be 3:1. 


6200 


Fic. 3. 
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7 x 10% x 1:52e7283 RT = 6:9 x 10° ———— 
7 > 9 litre sec.’ 
e690: mole 
5'0 X 1010 x 1-30e7373 RT = 8-8 x 106 ——_—_ , 
litre sec. 
l : mole 
instead of the experimental figures of 1:3 X 10 -———- and 
litre sec. 
g mole 

S'I X 10° en at the corresponding temperatures. 


The 6600 Cal. which appears in the exponential factor is the “ over 
all” activation energy at low temperatures. This energy W is in this 
region 


W = Q — Yır. + hva + thy,. 


Theoretically, 2, is thus given by: 


Mo+M,,\%21, A w 1 /500\ 2 
=6 20 (ata m\ Le TUR g- =.(2 7, a 
k =6:06 x 10 x ( MM, ) Re: e ie AN F ) ) (34) 


T enters into this formula, besides the exponential factor essentially in 
the —4 power, in contrast to the kinetic formula, and also to the 
theoretical temperature dependence at high temperatures. 

It is at present impossible to tell whether this discrepancy is caused 
by some errors in the experiments or by the imperfection of the theory. 
It must be emphasised, however, that it is impossible to fit both the low 
and the high temperature measurement on the reaction rate of (1) to 
the theoretical calculations by alteration of the constants v,„ Q, va of 
section 4. 

If one uses the low temperature measurements, instead of the tem- 
perature dependence of k, at high temperatures, to determine the 
vibrational frequencies, one obtains the following set of values: 
hv, = 10600 cals./mole, hvg = 1500 cals./mole and the activation energy 
becomes 7800 cals./mole. With these constants one can fit all measure- 
ments, except, however, the temperature dependence of k, and k; in the 
high temperature region. 

Possibly the calculation of Q from the temperature dependence of 
kı at high temperatures according to equation (30) is affected with the 
greatest inaccuracy. The temperature dependence of k) is namely 
derived from the data about the thermal conversion of p-H, — o-H,. 
The velocity of this reaction varies very much, since the thermal H atom 
concentration is enormously increased with rising temperature, and thus 
the directly measured figures on the temperature dependence of the 
reaction p-H, —> 0-H, show a more than ten times higher temperature 
coefficient than that of k itself. In k, for instance, the increase of the 
thermal H atom concentration has been taken already into account. 


In this connection it should be mentioned that the experimental figure for 


iT fe is practically the same as for reaction (1), namely 3°5, whereas 


* By dividing this formula by the kinetic formula (16) (if we neglect the tunnel 
effect) we obtain for the steric factor the complicated expression 


_Ma + Mm L h? 
S= -MM TFT 
Hence S is, in the low temperature region, proportional to T-1, 
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according to the formula (30) the theoretical value with the respective 
values for the molecular-constants of the complex DDD yields 4:12. 
This discrepancy may be due to the state of affairs just mentioned, 
which holds also for reaction (2). A small uncertainty in the larger 
temperature coefficients of the thermal ortho-para-conversions causes 
a comparatively large uncertainty in the temperature dependence of 
reactions (I) and (2). 

If we assume that the measurements at high temperatures on the 
reactions (I) and (2) are fairly correct, the discrepancy at low temperature 
between the experimental data and the theoretical rate must be ascribed, 
as already mentioned, either to errors in the experiments or to some im- 
perfection of the theoretical treatment. It is planned to revise the 
experimental data in a wide temperature range for both reactions, since 
it seems very important to decide whether the present theory of reaction 
velocities, as developed recently, is correct or not. 


Summary. 


The velocity constants of the elementary reactions of hydrogen and 
deuterium 


HFHH H o . e o W 
D+D,>-D,+H l ; f . (2) 
D+H,>DH+H : : : . (3) 
H+D,-HD+D : ; ; . (4) 


are calculated on basis of the formula 
= M, + Mn ie I, h\? 2m -OIRT N, 
b= vix ( MM, ) Er (=) aX quer Ar 
sinh Bum : 
“Ginh pvp? sinh By, Tr) 


which was derived by Pelzer and Wigner and Wigner for this type of 
reaction. The constants »„ and Im of the molecule are known from spec- 
troscopic data. The molecular constants I, v,a; for the different 
complexes involved in reactions (I to 4) are partly computed by the energy 
surface of these reactions and partly obtained by comparison of one part 
of the experimental data with the theory. The remaining experimental 
data on reactions (1 to 4) are then compared with the calculated velocity 
constants. | 
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On the Rate of Chemical Reactions 
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From less than 92 known elements, 
about a million compounds have been 
prepared by chemists. More than half 
of these compounds contain only four 
elements: carbon, hydrogen, oxygen and 
nitrogen. How is such a great number 
of eombinations possible ? 

Certainly there is something strange 
about this. If we mix two elements, e.g., 
sodium and chlorine, only three different 
‘‘eompounds’’ can arise; if we have more 
sodium than chlorine, practically all the 
latter will be bound to form rock salt and 
the remainder of the sodium will be left 
intact. If we have more chlorine than 
sodium, chlorine will be left over in addi- 
tion to the rock salt. There are only 
three compounds: sodium, rock salt and 
ehlorine. What an enormously different. 
situation we encounter if we consider 
now the compounds containing only car- 
bon and hydrogen! About ten thousand 
distinet compounds of these elements are 
known. The actual number of such com- 
pounds is probably only limited by the 
C and H atoms available to be built into 
the compounds. Certainly at any par- 
ticular temperature not all these com- 
pounds are stable. At ordinary tem- 
peratures, methane is the only hydro- 
carbon which is stable, in addition to 
which we can have excess of carbon 
(graphite or diamond) or hydrogen. No 
two pure solids of the same composition 
can be stable over a range of tempera- 
ture. 

Why is it, then, that although a ben- 
zene molecule could break up into three 
acetylene molecules, pure benzene re- 
mains unchanged for an indefinite length 


of time? However, if one adds some 
suitable compound (a catalyst) the sys- 
tem proceeds very rapidly to equilib- 
rium, the benzene decomposing into 
acetylene. 

It obviously is the slowness of most 
reactions which allows this almost end- 
less variety of organic substances to 
exist. This indeed is what makes life 
possible and prevents our immediate and 
complete combustion in the oxygen of 
the air. No wonder that the beginnings 
of the study of reaction rates are lost in 
antiquity. 

The first measurement of a reaction 
rate has been made by Wilhelmy in 1850 
and the problem of the mechanisms by 
which chemical reactions proceed has 
been a center of interest in chemistry 
ever since. Van’t Hoff, in his famous 
“Étude de dynamique chimique,’’ recog- 
nized elearly the dynamical nature of 
chemical equilibria. That is to say, 
whenever a reaction like 


2C.H, = CH, + C,H, 


comes to equilibrium, this means that 
the number of ethane molecules being 
formed in unit time is just balanced by 
the number of such molecules combining 
to give methane and propane. The well- 
known rapid increase of reaction rates 
with temperature results from the fact 
that only abnormally violent collisions 
ean lead to reaction and the number of 
such collisions increases rapidly with 
temperature. The necessity of high 
energy concentrations for reactions was 
pointed out by Arrhenius toward the end 
of the last century. 


i nn 
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At the present time, we believe that the 
fundamental features of the mechanism 
of reactions are well understood. This 
has been possible because of the pioneer- 
ing work of many brilliant men, only a 
few of whom we can mention here. W. 
C. McLewis was the first to caleulate the 
number of violent collisions. To the pio- 
neering work of Marcelin, Polanyi, Bron- 
sted and Herzfeld we owe the gradual 
development of the notion of the acti- 
vated state. Rice, Ramsperger and Kas- 
sel completed the theory of unimolecular 
reactions along the lines suggested by 
Lindemann and Hinshelwood. Lewis 
and Smith, Daniels and Trautz applied 
kinetic theory to monomolecular proc- 
esses and did much in the way of study- 
ing systematically many reactions. Later 
a considerable portion of reactions were 
found to go faster on the walls, and the 
understanding of these catalytic proc- 
esses have been conspicuously advanced 
by the work of I. Langmuir and that of 
H. S. Taylor. 

The rest of the development is an in- 
teresting example of the progress and 
obstacles and the way they are overcome 
in the advancement of science. As a mat- 
ter of fact, we shall see that the calcula- 
tion of absolute reaction rates is a simple 
example of the application of statistical 
mechanies. It is based on the concept 
that the atoms are moving under the in- 
fluence of ordinary forces which are 
also responsible for the chemical valence. 
The difficulty of physicists in grasping 
this situation was that they believed on 
the basis of the older quantum theory 
that chemical reactions involved rather 
mysterious quantum jumps. Tolman 
and Herzfeld derived formulas very 
similar in principle to those which we 
shall diseuss, on the basis of this theory. 
Chemists were not bothered by the notion 
of quantum jumps but were less familiar 
with the methods of statistical mechan- 
ics. As a matter of fact, Rodebush and 
Rice and Gershinowitz derived formulas 
which are similar to the ones to be dis- 


bo 


cussed. They, like earlier workers, how- 
ever, did not make the fullest use of our 
present knowledge of the nature of forces 
acting during a chemical reaction. 
When the notion of ordinary forces 


' acting between atoms was reestablished 


by the work of London, it soon became 
clear how the considerations just men- 
tioned can be formulated on a general 
basis. This is all that has been done by 
Polanyi and Pelzer in collaboration with 
the present writers. 

A nice way of representing the motion 
of the atoms is by a diagram in the ‘‘con- 
figuration space.’ Suppose we have 
three atoms moving with respect to each 
other in a straight line. The configura- 
tion space is two-dimensional then, the 
X coordinate being the distance of atoms 
l and 2, the Y coordinate the distance 
between 2 and 3. Every point in this 
configuration space eorresponds to a con- 
figuration of the three atoms. The forces 
between the atoms can be derived from 
the potential energy for this configura- 
tion. If we make a landscape over the 
configuration space such that the height 
at any point is equal to the potential 
energy for this configuration, a ball roll- 
ing on this landseape will represent the 
motion of the three atoms under the in- 
fluence of the forces between them. The 
relative position of the atoms will change 
in reality in such a way that the corre- 
Sponding point in configuration space 
always coincides with the position of the 
ball. If we are interested in a system 
of more than three atoms or if their 
motion is not restricted to a line, we must 
use a configuration space of more di- 
mensions, 

Stable chemical compounds correspond 
to low regions in our landscape. If the 
ball is in such a low region and has little 
velocity, it will stay in this region for- 
ever. There may be several regions of 
comparatively low energy, corresponding 
to several apparently stable groups of 
molecules. A reaction will then consist 


in the passing over of our ball from one 
low region to another. 

For such a passage, it needs, first of all, 
enough energy. The average amount of 
energy which such systems have is pro- 
portional to the temperature. However, 
at any particular temperature, some 
systems will have less than the average 
energy and a very small number, very 
much more energy. Only the systems 
with exceptionally high energy will be 
able to pass from one low region to the 
other, and the fraction of the systems 
which have this unusual amount of 
energy increases very rapidly with the 
temperature. This fact accounts for the 
rough empirical rule that the reaction 
rate doubles with a 10° increase in tem- 
perature. 

In order to make an actual calcula- 
tion of the reaction rate of the reaction 
H, + J,=2HJ, say, the fundamental idea 
of Gibbs may be utilized. Imagine a 
reaction vessel with a great number of 
H, and J, molecules. 
the vessel into small compartments, each 
containing one single H, and J, molecule. 
The number of collisions between H, and 
J, molecules will be still the same as in 
the original vessel. However, every H, 
can react with one J, only so that we 
need to consider the configuration space 
of one pair of molecules only, instead of 
considering the coordinates of all the 
molecules simultaneously. 

The instantaneous state in each com- 
partment can be represented by a point 
in the configuration space. There will 
be one point in the configuration space 
for each compartment. As the molecules 
move in each compartment, the corre- 
sponding points in the configuration 
space will swarm like people in a moun- 
tainous region. To begin with, only one 
valley in configuration space is popu- 
lated. The people scurry around ap- 
parently aimlessly. Most of them have 
too little energy to rise much above the 
floor of the valley. Even those who have 
enough energy to emerge from the crowd 
will go uphill at any arbitrary place and 
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Let us subdivide 
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only a few of the lucky ones will strike 
the path that leads into the neighboring 
valley. The number of these successful 
ones is all that concerns us. Their num- 
ber gives us the number of reactions in 
our original vessel. 

We can count these lucky people by 
multiplying their density at the top of 
the pass with the velocity with which 
they are traveling. Account must be 
taken, of course, of the fact that some of 
them which passed the crest of the hill, 
having encountered some obstacle, return 
without having descended into the new 
valley. | 

Both the density of the people in the 
pass and their velocity are given by 
standard formulas of statistical mechan- 
ics. Indeed, the velocity depends only 
on the temperature and the mass of the 
atoms involved. The density depends 
only on the temperature, the density of 
population in the valley and the height 
of the pass above the valley floor. Thus 
the whole reaction rate depends only on 
the nature of the landscape in the imme- 
diate neighborhood of the pass, the na- 
ture of the valley floor and the tempera- 
ture. It has practically nothing to do, 
however, with the intervening country. 
The paths leading from one valley to the 
other may traverse many lower passes 
and intermediate valleys. All this will 
have little effect on the density of the 
highest pass and thus practically no ef- 
fect on the reaction rate. 

The intermediate valleys correspond 
to the intermediate compounds, fre- 
quently isolated by skillful chemists. 
These give us important information on 
the topography of the landscape. The 
mere fact that they can be isolated shows, 
however, that they inhabit low valleys 
and not the critical pass where density 
determines the rate of reaction. 

Of all the quantities entering into the 
calculation of reaction rates the height of 
the pass above the valley is responsible 
for the greatest uncertainty. Except for 
a few cases, so far, it has always been 
necessary to derive this value from the 
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Fig. 1. THE CIRCLE P IN THE FIGURE ABOVE 
CORRESPONDS TO THE LINEAR CONFIGURATION OF 
THE THREE ATOMS AS INDICATED BELOW. THE 
FIGURE ABOVE REPRESENTS THE ENERGY SURFACE 
FOR ALL LINEAR CONFIGURATIONS. THE HEIGHT 
OF THE SURFACE ABOVE THE PLANE OF THE FIGURE 
IS CHARACTERIZED BY THE CONTOUR LINES, AS IS 
USUAL ON MAPS. 


experiments on reaction rates themselves. 
An improvement of our knowledge of the 


PHOTOGRAPHIC PICTURE OF THE LAND- 


Fig. 2. 
SCAPE OF Fic. 1, TAKEN FROM THE WORK OF 
C. F. GOODEVE. 


landscape is highly desirable, therefore. 
This will be achieved, no doubt, in two 
different ways—partly by a further im- 
provement in the theoretical calculation 
of such surfaces, partly also by experi- 
mental investigations from which em- 
pirical rules can be derived. The ques- 
tion of absolute rates thus reduces to the 
construction of the appropriate energy 
surfaces, after which it becomes a nple 
problem of arithmetie. 


Calculation of the Rate 
of Elementary Association Reactions 


E. P. Wigner 


Journal of Chemical Physics 5, 720-725 (1937) 


(Received June 18, 1937) 


An upper limit for the rate of association reactions is found by determining the probability 
of a decrease of ihe relative energy of two atoms below zero energy, under the influence of a 
third body. The relation of this approximate calculation to the rigorous solution of the problem 
is discussed in Section 2. The results are applied to the recombination of J atoms, measured 
by Rabinowitch and Wood. Numerically, the agreement is quite good; however, the calculated 
values are somewhat too low, which cannot be explained by an inaccuracy of the method. 
Reasons for the discrepancy other than the possible nonadiabatic character of the reaction are 


discussed. 


1 


HE transition state method! allows the 
calculation of the rate of reactions which 
involve an activation energy. The assumptions 
which one has to make in order to apply this 
method are (1) that the reaction shall not involve 
a jump in the quantum state of the electrons,? 
1 Cf. for previous literature e.g. M. Evans and M. 
Polanyi, Trans. Faraday Soc. 31, 876 11935), 
? The possibility of chemical reactions without quantum 
jumps in the state of the electronic system has been first 


realized by F. London, Sommerfeld Festschrift (S. Hirzel, 
1928), p. 104. 


(2) that classical mechanics be applicable to the 
motion of the nuclei (or that the ordinary way of 
taking into account quantum effects be justifiable) 
and, (3) that the number of systems crossing the 
activated state be the number of systems re- 
acting. This last condition is always satisfied at 
sufficiently low temperatures and if the energy 
surfaces are not too complicated, the temperature 
at which their validity ceases will be rather 


high.” The common feature of the reactions to 


3 Cf. for a closer discussion of these conditions, E. Wig- 
ner, Trans. Faraday Soc. (1937). 
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which the transition state method can be applied 
in its usual form* is that the transition over a 


state with high energy betheratedetermining step. 


However, some of the most simple reactions, 
like the association of atoms to a molecule in a 
three-body collision, cannot be calculated by the 
present form of the transition state method, 
although its original formë embraces these reac- 
tions also. The purpose of the present note is the 
derivation of a formula for the rate of reactions 
of the type 2/-+A =J2+A, or 


Ait A+A =A A+A. (1) 


The assumption of the validity of classical 
mechanics which will be necessary in the present 
derivation also, is better fulfilled than for the 
reactions with activation energy. Also the exact 
shape of the energy surface enters much less 
critically, because there is no activation energy 
Q, and a calculation of the absolute rate is 
possible at least approximately. On the other 
hand, there is no limiting case in which the 
formula (11) to be derived here is a consequence 
of well-established principles of statistical me- 
chanics. Such a limiting case existed for reac- 
tions with activation energy as that of very low 
temperature and sufficiently heavy masses to 
assure the validity of classical mechanics. 
Eyring, Gershinowitz and Sun® have given a 
very ingenious method by which the rate of 
(1) can be rigorously calculated if the energy 
surface can be divided into two regions, both of 
which are developable. This is very nearly true 
for that part of the energy surface for the 
H+H+H=H.2+4H reaction which corresponds 
to the straight configuration of the atoms. 
However, it cannot be claimed that the majority 
of associations will proceed from a nearly straight 
configuration. For a bent configuration, the 
limits of the integration of Eq. (7) reference 6 
should be changed and this will increase the 
probability of association for bent configurations. 
Eyring, Gershinowitz and Sun® tried to compen- 


4H. Eyring, J. Chem. Phys. 3, 107 (1935); M. G. Evans 
and M. Polanyi (reference 1). 

ë M. Polanyi, Zeits. f. Physik 1, 90 (1920). The first 
complete outline of the theory has ‘been given in R. Tol- 
man's Statistical Mechanics (Chemical Catalog Co., 1927). 
Cf. also K. F. Herzfeld, Kinetische Theorie der "Wärme 
nee Handbuch der Physik, second edition, 

t H. Eyring, H. Gershinowitz, C. E. Sun, J. Chem. Phys. 
3, 786 (1935). 


7 This does not preclude the possibilit 
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sate for neglecting to change the limits of their 
integral by considering the neighborhood of the 
isosceles triangle configuration also. Thus they 
counted some reactions twice, some not at all. 
This criticism of the very ingenious work of 
reference 6 should not be taken, of course, too 
seriously, since all that was claimed, and all that 
can be done at present, is to obtain the reaction 
rate within a not too large factor. That certainly 
has been accomplished by Eyring, Gershinowitz 
and Sun. The point was brought up partly to 
illustrate the point that an exact calculation of 
the rate of (1) is not feasible at present’—not 
even under simplifying assumptions for the 
energy surface—and partly to jusuty taking up 
the subject here again. 


2 


Theoretically, of course, one could obtain the 
rate of (1) by considering the macrocanonic 
assembly of systems, containing an A, an As 
and an A; atom in a great box of volume v. One 
can consider then those systems which are in the 
dissociated state at present but will go over into 
the right side of (1) within a time £. Whether a 
point in phase space corresponds to the left or 
right side of (1), may be ambiguous if all three 
atoms are close. However, if t is sufficiently large, 
this ambiguity will extend to a very small 
proportion of all the systems reacting, most of 
the molecules formed will be sufficiently far away 
already from the third body to leave no doubt as 
to their associated character. On the other hand, 
t shall not be taken long enough to allow for a 
second collision, i.e., £ shall be small compared 
with the time between collisions but very long 
compared with the collision time. 

The systems which will react before i will be 
contained in a region R; of phase space. The ends 
of this region may be a little uncertain, but the 
total probability of R; will be well defined. The 
of calculating 
the absolute rate of (1) more accurately than can be done 
in. practice for reactions with activation energy. Although 
the transition state method should give nearly exact re- 
sults for the latter, the parameters of the energy surface 
enter in such a critical way into the formulas that a 
calculation of the whole rate is hardly feasible at present. 
Cf. also reference 3. 

8 The second section serves only to establish the connec- 


tion between the (impracticable) exact calculation of the 
rate and the approximate calculation of the ensuing 


sections. 
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rate constant k is this probability, divided by ¢. 

As we increase t (keeping it still below the time 
between two three-body collisions), Rı will 
assume the form of a longer and longer tube in 
phase space. We can put a surface S across this 
tube and k will be the number of systems 
crossing S. It does not matter where the surface 
S lies, it only must have no wrinkles which could 
be crossed by the same system twice. All the 
systems will cross S in the same direction. 

The number of decomposing systems can be 
counted in a similar way. Furthermore, the 
continuation of S may be taken for the surface 
for this counting. If we do this, and still allow no 
wrinkles in S, the number of systems crossing it 
in one direction will be the number of associa- 
tions, the number crossing it. in the other 
direction, the number of dissociations (which, nat- 
urally, must be equal to the number of associa- 
tions). Apart from having no “wrinkles,” S can 
be quite arbitrary as long as it divides the whole 
phase space into two separate parts: those 
corresponding to the left and the right side of (1). 
If the distance of A; is very great from A, and 
Aa, S must be the H)=0 surface, where A, is the 
mutual energy (potential and kinetic) of Aı 
and Ao. 

If the distance of the third atom is small, and 
one still would take Hy)=0 for S, this S could 
possibly show wrinkles. These wrinkles would 
correspond to the relative energy of Aı and Ae 
falling below zero and rising again above this 
value during the same collision. No association 
would result then, although, counting the 
crossings through S in one direction, one would 
count these crossings. However, one can define an 
Hy depending on all coordinates, such that Ho=0 
should have no wrinkles, i.e., no path in phase 
space shall cross it more than once during one 
collision. This Ho will be equal to the relative 
energy of A; and A; for great distance of the As, 
will be different from this, however, if A; is near 
to A, and Az». It will divide the whole phase space 
in two distinct parts and it will have the property 
that the total number of paths crossing it is as 
small as possible. This latter condition does not, 
of course, determine S (and even less Ho), but it 
determines that minimum number, which is the 
reaction rate k. 
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We shall calculate now the number of systems 
crossing the Ho = 0 surface in unit time. No special 
assumption concerning the position of this 
surface will be made in this section and the 
formulas (3) and (5) hold, therefore, quite 
generally. In the later sections, however, 77, will 
be interpreted (for every position of A;) as the 
energy of A; and A; in the coordinate system in 
which their center of mass is at rest. Formula (11) 
will give, therefore, only an approximate value 
(an upper limit) for the rate constant k. 

If H is the total Hamiltonian, we have 


dHo 0H, a 0H» dp; 


ðq: dt Kap di 
ôH ðH 0H, 0H 


— (2) 


ðq: OP; OP; 9 
0H 0(H— Ho) 0H, =") 
ap; ap; ; 


Those systems will cross the Họ=0 surface 
in the decreasing sense, which are nearer than 
(dH)/dt)/|grad Ho| from it, provided dH)/dt is 
negative. We have, therefore (Z is the total 
integral of e=#!*T over phase space) 


00; ôq; 


v? i dHo/dt 
Jerta, (3) 
ZJ |grad Hol 


where do is a surface element of the Hi=0 
surface and the integration has to be extended 
over the region in which dH)/dt <0. 

This is simply the derivative of the integral 


E=- f- +f Hoane? 


Ho<E 
dHo/dt<0 


-dqi° . -dqndqu° š -ddn (4) 


with respect to E at the point E=0 


adl(E) 
UB) g 
ZN dE / m 


[The real k is (rigorously) the minimum value of 
this expression for such Ho(qi, ++, Ga; Pur ***, Pa) 
which go over into the proper energy of the two 


associating particles for a large distance of the 
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third body but are not everywhere necessarily 
equal to it. ] The integral I(E) is to be extended 
over that region of phase space in which Ho <E 
and also (2) is negative. 


4 


As an example, the rate of a simple reaction 
(1) shall be calculated where all A are atoms. The 
approximate Ho which shall be used will be (for 
every distance of the A;) the proper energy of the 
associating atoms. We shall use in what follows 
the following coordinates: the components the 


vector qız pointing from A; to As, the com-' 


ponents of q; connecting As with the center of 
mass of A; and A», and finally the components of 
the vector Q pointing to the center of mass of all 
three atoms. The corresponding momenta are 
Piz, Pa, P ’ 


1 
P12? + Vo, (6) 


m, 


H= 


where m, is the relative mass of the associating 
atoms 1/m,=1/mi+1/m2 and Vo(rie) is the 
potential energy V(rız, 713, r23) for ri =rz= ©. 
Thus 


1 1 
H—H)=—ps’+—P’?+V— Vo, (7) 
2u 2M ; 


where 1/u=1/(mı+ms)+1/ms and M is the 
mass of all three atoms together. 
We have then by (2) 


dHy 1 
— = ——(Pie, gradi: (V- Vo)) (8) 
di Mr 


and, 


IE)= f be f (1/m,) (pis, gradıe (V— V2) 


Ho<E 
‚ed Tdg12dgad QdpidpsdP. (9) 


The integration over Q gives the total volume v 
that over p; and P gives (2rukT)?(2r MkT')}. One 
can perform an orthogonal transformation for pie 
making pi, parallel to gradı2 (V— Vo). The con- 
dition dH,/di <0 means that the scalar product in 
(9) is positive, i.e., that pie. has the sign of the 
grad. The conditions Ho <E means pizt Pin,’ 
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+ $12.2 <2m,(E- Vo). Thus (9) becomes 


KE) = (2rkT) (aM) im- f TE f das f dpi: 


Ho<£, Pızr >0 


-P12.|gradı2z (V— Vo) | 
-exp (—pı2/2m,kT— V/kT). 


The integration over fiz can be performed 
immediately and that over pio, and fıs: also if 
one introduces polar coordinates in the p112: 
plane. One obtains 


I(E) = (2rkT)‘kTom, (uM)? 


E f dan f dasleradss (V- Vo) | 
Vo<E 


- {exp (— V/kT)—[1+(E-Vo)/RT] 
-exp [—(E+V—Vo)/kT ]}. 


For (5), we have to differentiate this with respect 
to E. Differentiating with respect to the limit of 
integration involves setting Vo=E in the inte- 
grand, which vanishes hereupon. Differentiating 
inside with respect to E, setting E=0 and 


(10) 


dividing by Z/W#=v(2rkT)'(mımım;)? gives 
(mmm = mMm uM) 
k= (2rm,kT)- Í dais f das 
Vo<0 
- |gradıs( V— Vo) |(— Vo/RT je- Y-Y +T, (11) 


If the associating atoms are different and there 
are nı and n: of them in the volume v, and 2; of 
the third body, the number of effective collisions 
will be ninan; times (11), i.e., the rate constant is 
given by (11). If the associating atoms are 
identical and their number 2, the number of 
effective collisions will be (n(n—1)/2)ns times 


(11). Thus in case of the formation of a sym- 


metrical molecule the well-known factor 3 must 


be applied to (11). In addition to this, in most 


cases the reaction can occur only for definite 
relative orientations of the angular momenta of 
the atoms and this must be taken care of by 
another factor giving the probability of such an 
orientation. It may be pointed out again that (11) 
is only an upper limit for the rate of association 
and one could probably somewhat lower it by 
choosing H, more appropriately. 
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Equation (11) gives always an upper limit for 
the rate of association. However, if V— Vo can 
assume negative values, this upper limit can be 
lowered somewhat quite easily. Indeed, in a 
system formed by three atoms, the total relative 
energy H,=H—P?/2M never can be negative 


Fic. 1. 


and it is not necessary, therefore, to include 
systems with negative energy into the assembly. 

For large distance of the third body, naturally 
H,>H,. However, at ps*/2ut+V—V,)=0 the 
H,=0 surface cuts the A,=0 surface and goes 
above it. It is a better choice for S, then, to use 
the H,=0 surface where it lies above the H)=0 
surface, than this one, because no system can go 
through the H, =0 surface, H, being (apart from 
the influence of the walls) an integral of the 
motion. 

The integration of (9) over ps should not be 
extended, therefore, over the region p;?/2u+ V 
— V)<0 in (9) and a factor 


f exp (— ps?/2ukT)dps 
p3?>24(Vo—-V) 
/ f exp (— ps’/2ukT )dp3= 
0 
1— F((Vo— V)#/(RT)}) | 
+2((Vo— V)/rkT)}er'YoWwiet (12) 


can be inserted to the integrand of (11) wherever 
Vo— V is positive. F is the error integral 


2 z 
Foa- f edx. (13) 
a 0 


Another improvement can be made on (11) if 
the attractive region of Vo is preceded by a 
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repulsive part. However, since it is a rather rare 
occurrence, it will not be discussed further. 


6 


If the third body is an inert gas, the total 
potential energy is the sum of the interactions of 
the three pairs of atoms. V-Vo=Vıs(ri;) 
+ Vos(res) is positive in these cases and the 
correction (12) cannot be applied. The integrand 
of (11), as function of gs has a rather sharp 
maximum on two spherical portions, represented 
in Fig. 1. The radius 713 of the sphere around A, 
is the sum aıs of the collision radii of Aı and A;, 
similarly 723 is the sum as; of the collision radii of 
A, and As. On the first sphere, one can introduce 
instead of gs polar coordinates around Aı. 
Integration over the polar angles gives the solid 
angle ai; of the first spherical portion. This is 


oi3(qiz) = mL (Gist giz)? —Ge3? |/aisqie (14a) 
for diz <dig Ans and 
alqi) = 4r (14b) 


for gi2>@13+d@23. On the sphere around Aı, 
one can set V— Vo= Vi3(ris), and the deriva- 
tive of this, with respect to qi2, at constant 
q3, is ma/(mı-+me) times the derivative with 
respect to 713. Taking the factor a1s’a13(q12) 
out of the integral, one must integrate 
18Vis/Oris| exp (— Vis/RT) with respect to 713 
between 0 and œ, which gives just kT. One 
obtains, performing a similar transformation on 
the sphere around A; also, 


(2rm,kT)! 
eal 


Mi+ Me 
Vo<o 


$ Volmaaıs?aıs(q 12) +1 1023"a23(g 12)] 


2rm,\ è? i 
=-2( T \f Vogi: { (a13? — a23?) 


Vo<o 


- (a1s/mı—a23/mz) +2(a1s?/mı+a2s?/ma)gı2 
+ (a1s/mı+a23/m2)gı2? }dgıe. 


In the last expression, (14a) has been used 
throughout because in most practical cases Vo is 
very small for gaz >di3-+d3. 

This equation will be compared with the 
experimental results of Rabinowitch and W. C. 


(15) 
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Wood? on the recombination of iodine atoms. 


For V the Morse curve was used 
Vo= D(e72 20-70) — 2eretr-r0) (16) 


with! D=2.46-10-" erg, a=1.83-108 
ro = 2.66-10-8 cm. This gives 


s S Vogis?dqie = 2.7.1035 erg cm? 


cm, 


and 
— S Vogidgi2a=0.91-107% erg cm‘. 


In this case, d13=de3=rz;+rs and the values 
shown in Table I were adopted for the radii.” 
The last line gives the values calculated by (14). 
These must be divided first by 2, because the 
associating molecules are identical, and next by 
16, because only one of the ten states formed by 
two ?P;; iodine atoms leads to the normal state 
of the molecule. The experimental results are 
compared with the calculated ones in Table II. 

Although the disagreement is not very great, 
it is important to notice that the calculated 
values are lower throughout than the experi- 
mental ones. For the molecules He, Ne, Oz this is, 
perhaps, not so serious, because (15) should hold 
for atoms only.!? For He and A, however, (15) 
should definitively give an upper limit for the 
reaction rate. The calculated values should be 
indeed materially higher than the observed ones. 

One possible source of error is contained in the 
radii of Table I. It would seem, however, that 
these hardly can be in error by a sufficient amount 
to explain the discrepancy. The second source of 
error lies in the potential curve (16). It cannot be 
expected, of course, that the Morse curve be 
accurate throughout. In fact, if one determines 
the a of (16) from the anharmonicity of the 
vibration instead of dissociation energy and 
vibrational frequency," one obtains 1.47-10° 


®E. Rabinowitch and W. C. Wood. J. Chem. Phys. 4, 
497 (1936). 

10 Cf, for the spectroscopic data, H. Sponer, Molekül- 
spektren (Berlin, 1936), Vol. I, p. 18. 

u Cf. Landolt-Börnstein’s Tables, ist Ergänzungsband, 
p. 69. The radius of Xe, i.e., 1.75A, was used for J. 

12 In addition to this, the formation of an intermediate 
iodide of the third body may be considered. Cf. reference 9. 

13 For a discussion of this and related questions, cf. 
reference 10, Vol. 2, p. 103 f. 


725 


cm!. This would compensate the discrepancy. 
However, the curve of Brown" for Vo integrated 
directly, gives practically the same result as (16). 

It is possible also that not only a O,* state is 
strongly attractive among the 10 different states 


TABLE I. 
He A He Ne Oz 
r-108 1 1.43 1.09 1.58 1.48 
k-10% 4.5 6.5 5.4 7 6.7 
TABLE II 
He H: N: Oz 
calc. k-10% | 0.14 0.20 0.17 0.22 0.21 
exp ‘' “ 0.18 0.36 0.40 0.66 1.05 


arising from two ?P3;: iodine atoms. Although all 
states known so far are either repulsive or only 
very weakly attractive, this possibility must be 
left open. No such difficulty would come up at 
the calculation of the rate of recombination of H 
atoms, e.g., since all states are known in that 
case. 

There is one more point which should be 
mentioned. For very large gi2, the Morse curve is 
not valid and the attraction due to the magnetic 
moment of the spins decreases only as the inverse 
third power of gı2. Since the square bracket of 
(15) becomes 8rmzaıs? for very large qıs; the 
integral appears to diverge logarithmically. It 
does not appear sensible, however, to consider 
two atoms as forming a molecule if, in all 
probability, they will suffer a dissociating colli- 
sion before completing a vibration. One is thus 
led to extending the integration over qıs to the 
mean free path only. The contribution of the 
magnetic dipole forces then becomes negligible 
and the breaking down of (15) (caused by the 
incorrect situation of S) does not become serious. 


u W., G. Brown, Phys. Rev. 38, 1187 (1931). 
16 Cf, reference 14, also J. H. Van Vleck, Phys. Rev. 40, 
544 (1932). 
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1. 


According to our present notions, the theory of reaction rates 
involves three steps. First, one should know the behaviour of all 
molecules present in the system during the reaction, how they will move, 
and which products they will yield when colliding with definite velocities, 
etc. Practically, this amounts in most cases to the construction of the 
energy surface for the reacting system. Professor Eyring told us about 
the results which can be obtained by the application of quantum 
mechanics to molecular systems for this part of the theory. The second 
step in the theory I would call the statistical part. It endeavours to 
solve the problem of the rate of elementary reactions. Assuming only 
the material on the left side of a chemical equation to be present in a 
vessel, and the molecules of these to have the Maxwell-Boltzman energy 
and velocity distribution, one wants to know how many molecules corres- 
ponding to the right side of the equation will be formed in unit time. 
The elementary properties of the molecules are supposed to be known in 
this second step and one wants to express the reaction rate of elementary 
reactions in terms of these. The present paper will be devoted entirely 
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to this second step. The third step is the consideration of the co-opera- 
tion of the various elementary reactions, which may occur beside and 
must occur after each other in order to complete a real reaction. In 
especially favourable cases there is only one important chain of reactions 
leading to the final products and this has one link which is so much slower 
than all the others, that it is made responsible for the observed rate. 
The others are then assumed to be so much faster that one has practically 
equilibrium between the two sides of their chemical equätions. 

The fact that one has to subdivide the calculation of the reaction 
rates into three different steps shows that it has not acquired the neatness 
and finality of the theory of equilibria. I should like, on this occasion, 
to point again to one “ elementary reaction,” which, though often men- 
tioned, never has been properly taken into account. It is the production 
of fast molecules, the establishing of the Maxwell-Boltzman velocity and 
energy distribution. This was tacitly assumed to be a fast reaction and 
in complete equilibrium, before I began the discussion of the second step. 
In many cases this will not be true. 

Naturally, the treatment of the statistical step will be different for 
the different kinds of elementary reactions. It seems that one can 
divide the elementary reactions into three groups. I would class into 
the first group those which do not change either the electronic quantum 
state of the colliding particles nor their chemical formula and effect an 
interchange of kinetic, vibrational and rotational energy only. The 
consideration of these is, as I just pointed out, a rather neglected dis- 
cipline in chemistry.! The physicist is strongly interested in them for 
the so-called problems of mean free path (viscosity, heat conduction, etc.). 
Into the second group, one could class all reactions which involve no 
jump in the electronic quantum numbers but change the chemical con- 
stitution. These are the most common types of chemical reactions * and 
the transition state method applies to these alone. The remaining third 
class ® deals with reactions which involve a jump in the electronic struc- 
ture. This type of reactions has been dealt with in several papers of 
Professor Polanyi; it is clearly the most general type and probably the 
most difficult of all. 


2. 


The rest of this paper will be devoted to the consideration of the 
second kind of reactions. I shall endeavour rather to emphasise the 
basic assumptions of the theory than to derive ready formulas. 
Especially on account of some discrepancies with experiment, I think 
that it may be useful to see that the transition state method is based, in 
addition to well-established principles of statistical mechanics, on only 
three assumptions, two of which are generally accepted. 

The first assumption is that the comparatively slow motion of the 
nuclei is followed by the rapid motion of the electrons to such an extent 


1C. Zener, Physic. Rev., 1931, 37, 556, and O. K. Rice, J. Am. Chem. Soc., 
1932, 54, 4558 have studied the excitation of vibrations and the changing of 
rotational energy through collisions. Cf. also J. M. Jackson and N. F. Mott, 
Proc. Roy. Soc., A, 1932, 137, 703- 

2 According to quantum mechanics, chemical reactions can occur without 
a jump in the quantum state of the electrons. This fact was first recognised 
by F. London, Sommerfeld Festschrift, S. Hirzel, 1928, p. 104. , 

2 Quantum mechanics was first applied to these by F. London, Z. Physik, 
1932, 74, 143, and by L. Landau, Physik. Z. Sowjetunion, 1932, 1, 88 and 1932, 
2,146. Cf.alsoO.K. Rice, J. Amer. Chem. Soc., 1933, 1, 375. 
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that they are, for every position of the nuclei, in the lowest quantum 
state ? (adiabatic assumption). The energy of this lowest state, to- 
gether with the electrostatic energy of the nuclei, will be denoted by 
EX, Xa - - -» Xgn) where X,, Xa - - ., Xs, are the co-ordinates of 
the nuclei. Under the adiabatic assumption the nuclei will move as if 
they were acted upon only by the potential E(X}, Xa. - ., Xgn). 

The second assumption is that the motion of the nuclei under this 
potential can be described sufficiently accurately by classical mechanics 
(or, if this is not the case, the usual way of taking the quantum effects 
into account is correct). 

The last assumption can be best given at the end of the description 
of the transition state method, which will be the subject of the next 
section. 


3. 


For the calculation of the reaction rate it is sufficient to consider a 
system with as many atoms as occur in the chemical equation of the 
elementary reaction, since for a system with many atoms, the reaction 
rate will be simply proportional to the concentrations. Following Gibbs’ 


Fig. 1.—The regions (i) and (r), the 
activation point A and the surface 
F, in the section of the configura- 
tion space, corresponding to a linear 
configuration of the atoms H, H, Br. 
The ordinate is the distance of the 
two H atoms in A, the abscissa the 
distance of the middle H from the 
Br. The numbered lines are con- 
tour lines of the energy surface, 
constructed by Eyring and Polanyi 
for the reaction 


H + HBr = H, + Br. 


procedure, one will consider very many such systems, forming a macro- 
canonic ensemble. The configuration space of the system will have 3n 
co-ordinates and one can plot the function £,(X,, Xa... Xan) along 
a (3n + 1)th direction. Examples of such energy surfaces, for two- 
dimensional sections of the configuration space, are given in Figs. 1, 2, 
and 3. (The legends of Figs. 2 and 3 refer to questions to be brought 
up in section 5). 

The configuration of every system of the macrocanonic ensemble 
will be represented by a point in the configuration space. At ordinary 
temperature the points of most systems will be in regions of low energy. 
One such low region will correspond to the left side of the chemical 
equation, another low region to the right side of the equation. In the 
first region (2), the co-ordinate differences are small for those atoms which 
are in the same molecule on the left side of the equation. In the second 
region (r), the co-ordinate differences are small for those atoms which 
form a molecule on the right side of the chemical equation. The transi- 
tion between the two regions forms the reaction. 

We assume now that the two low regions are separated by a higher 
region, the height of which is at least great compared with the ordinary 
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thermal energy. Then, all paths leading from the first valley into the 
second will have a highest point. This highest point will be different for 
different paths. The paths for which the highest point is as low as 
possible are said to pass through an activation point A, which indeed is 
the highest point of these paths. The situation is better illustrated in 
the figures than can be done in words (Figs. 1, 2, and 3). 


b 
Fig. 2.— Reaction 
with an intermedi- 
ate region (i). At 


not too low tempera- 

tures, most points 

coming from (r) and 

entering (i) will re- 

\ turn to (r), rather 
\ | than to go through 
fi F the narrow channel 

A’ to (i). In the 

A reverse reaction, the 
j points coming from 
() and passing A’, 

will soon go over to 

(r), so that the con- 

w centration of points 
in (#) will not cor- 
respond to equili- 


at 
à brium. 


Most reactions will have not only one activation point, but a whole 
super-surface A of activation points (activation surface). Indeed, if 
we displace or rotate a system in the activated state, the state thus 
obtained will still be an activated state. All activation points have the 
same energy Q, and no system can react which has less energy than 
this amount. 


Fig. 3.— The sec- 
tion of the acti- 
vation surface 
with the plane 
of the Figure is 
the contour line 
6, from A to its 
forking point. 
A path shows a 
point crossing 
the activation 
surface and re- 
turning to (i) 
again. 


If we move on the activation surface A, the energy E, will remain Qo. 
If we move in the direction from which the path defining the activated 
state comes, or if we move in the direction of this path, the energy will 
decrease, because the activation point is the highest one of this path. 
In all directions perpendicular to the direction of steepest descent, how- 
ever, E, will increase. Otherwise, the highest point of an adjoining 
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path would be lower than Q,. The activation surface A forms a saddle- 
region; it is a maximum in one direction (in the direction of the path) 
but it is a minimum in all other directions. 

We can now draw a 3n — I dimensional surface F through the acti- 
vated points, perpendicular always to the direction of steepest descent. 
A will have, in general, less than 3m — I dimensions and lie entirely in 
F (cf. Fig. 1). Fora reaction in a system to occur, it is necessary that the 
representative point shall cross this 3n — ı dimensional surface. Most 
crossings will occur, of course, in the neighbourhood of the activation 
points, at any rate at such points the energy of which is not many kT 
greater than the activation energy Qo. Itis easy to calculate the number 
of systems of our assembly which cross F in one direction in unit time. 
It is simply equal to the density of points on the surface multiplied by 
the mean velocity perpendicular to the surface, and this integrated all 
over the surface : 


n = exp. (— E,/kT)ðpdo . . 0.) 
F 


For the practical evaluation of this integral one can expand E, into 
a power series of the co-ordinates which represent the distance from the 
activation surface A: 
3n — s —1 


EB=%+ D> Ulis- n EME. . . (2) 


i-l 


where &,, . . ., & are s co-ordinates within A and n,, . - ., Nsn-s-1 are the 
other 37n — s — 1 co-ordinates within F. Alla,l&, . . . &,) are neces- 
sarily positive and (1) can be evaluated, if (2) is sufficiently accurate, by 
elementary methods. 

We can formulate now the third assumption for the validity of the 
formulas of the transition state method. If all paths crossing the surface 
F in one direction originate in the low region / and, crossing F only once, 
end in the low region r, the reaction rate is simply given by the ratio of 
(1) to the number of systems in the region J, which is 


| exp. (— BykT)dX,..-dXs, . 0.00. 
i 


Here again, a development of the form (2) is possible in most cases and 
this allows a simple evaluation of (3). 

An especially simple way to evaluate the ratio of (I) and (3), i.e. the 
rate constant, has been developed by Eyring and by Evans and Polanyi.‘ 
Their considerations can be summarized as follows. Clearly, the ratio 
of (1) and (3) will be the same for all systems in which the potential in 
(2) and F is the same, while it may be different in the intermediate region. 
One can replace, therefore, the real potential which is decreasing in the 
direction perdendicular to F, by a potential which increases in this 
direction. The region of F will represent then a metastable molecule, 
the “ activated molecule,” the probability C of which can be calculated 
by the well-known formulas for chemical equilibria. The activated 
molecule will have a vibrational frequency v corresponding to the motion 
perpendicular to F, so that C will contain a factor kT fv. 


1 M. C. Evans and M. Polanyi, Trans. Faraday Soc., 1935, 31,875; H. Eyring, 
J. Chem. Physics, 1935, 3, 107. 
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The activated molecule will pass through F in one direction v times 
per second, so that the rate constant becomes 


k=Cy . š 5 Pui . (4) 


This, of course, in reality does not depend on v, #.e., on the steepness of 
the increase of the artificial potential perpendicular to F. Equation (4) 
is very practical for actual calculations. It is, however, quite easy to 
evaluate (1) and (3) as they stand also.® 

Most formule for C take quantum effects into account also. Strictly 
speaking, it should be remembered that it is only in classical statistics 
permissible to make the above change in the potential. 

Before discussing the theoretical foundations of the theory, I should 
like to quote a few characteristic instances in which it was possible to 
compare the theory with experiment. 

The difficulty of this comparison lies mainly in the difficulty of ob- 
taining accurate energy surfaces. I think, therefore, that at the present 
stage of the theory one must derive the most important quantity in 
(4), the activation heat Q,, from the experiments themselves. The fre- 
quencies of the activated state can be derived from the calculated surface 
with less danger. If one determines these from the observed rate also, 
very few data may be left as an actual check of the theory. This pro- 
cedure can be used only for reactions for which rich experimental material 
is available, especially if the isotopic effect is known. 

The experimental difficulties in measuring reaction rates are only too 
well known. Since at least one quantity for (4), namely Q,, is derived 
from the experiments, the values of the rate at at least two temperatures 
must be known. If these temperatures are too close, a relatively small 
error in the measurement of the rate may lead to an incorrect determina- 
tion of Q, and a very great inaccuracy in the remaining factors. 

The oldest example of the application of the transition state method 
is the reaction. H + H, = H+ H and the similar reactions with 
deuterium.* They have been measured partly directly, partly by the 
ortho-para-conversion.” The calculation of the corresponding energy 
surface ê is the simplest example of its kind. Although in my opinion, 
the agreement between experiment and theory is not complete, the dis- 
agreement amounts to less than 50 per cent. with the most suitably 
chosen set of constants. A similar example is Cl + H, = CIH + H, 
where the agreement with the measurement of Rodebush ® is within 7° 


‘In this way the transition state method was first applied to actual energy 
surfaces. Cf. H. Pelzer and E. Wigner, Z. physik. Chem., B, 1932, 18) 445. . The 
uantum corrections were included ibid., 1932, 19, 203. Formule similar to 
ose resulting from the transition state method ® were obtained not much later 
by W. H. Rodebush, J. Chem. Physics, 1933, 1, 440 and by O. K. Rice and H. 
Gershinowitz, ibid., 1934, 2, 853. However, these are very ingenious guesses 
rather than real derivations based on statistical mechanics. Consequently, the 
results are not quite identical with those of the transition state method. 

* Cf. H. Pelzer and E. Wigner, and E. Wigner 4- J, Hirschfelder; H. Eyring 
and B. Topley, J. Chem. Physics, 1936, 4,170; L. Farkas and E. Wigner, Trans. 
Faraday Soc., 1936, 32, I. 

7A, Farkas, Z. Electrochem., 1930, 36, 782; Z. physik. Chem., B, 1931, 10, 
419; K. H. Geib and P. Harteck, sbid., 1931 ; Bodensteinband, 849 ; A.Farkas 
and L. Farkas, Proc. Roy. Soc., A, 1935, 152, 124. | 

°H. shia and M. Polanyi, Z. en Chem., B, 1931, 12, 279; 
J. Hirschfelder, H. Eyring and N. Rosen, f hem. Physics, 1936, 4, 121. 

W, H. Rodebush and Klingellioeffer, J. Amer. Chem. Soc., 1933, 55, 130 ; 
M. Bodenstein, Trans. Faraday Soc., 1931, 27, 8, 413. 

10 A, Wheeler, B. Topley and H. Eyring, J. Chem. Physics, 1936, 4, 178. 
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the experimental error. The two temperatures at which the rate is 
measured are very close in this case (0° and 25°) and the limits of error very 
large. 
Two four-body reactions have been investigated: H, + I, = 2HI 
and H, + ICI = HI + HCl. The discrepancy in these cases is a factor 
of 25 and 300 respectively. The construction of potential surfaces is 
quite difficult in these cases. 

Perhaps the most interesting example is that of the reaction 
2NO + O, = 2NO, studied theoretically by H. Gershinowitz and 
H. Eyring.? The energy surface is not known for this reaction. It 
is clear, however, that in the activated state vibrational degrees of 
freedom take the place of several rotational degrees of freedom of the 
left side of the chemical equation. Thus the phase space available is 
strongly diminished in the activated state and the reaction rate unusually 
low. In spite of the absence of a detailed energy surface, Gershinowitz 
and Eyring could calculate the reaction rate with a surprising accuracy 
and also explain the negative temperature coefficient of this reaction. 

I have given only the simplest examples of the comparison of theory 
and experiment, omitting entirely the highly interesting work of Polanyi 
and Evans on liquids,1? of Bawn on “‘ slow ” reactions, and the extension 
of the transition state method to the ionogenic reactions by Ogg, Polanyi, 
and A. G. and M. G. Evans.!* This subject will be taken up probably by 
other speakers. 

On the whole, I feel that the agreements are not as good as could be 
expected. Doubtless the experiments are partly to blame for this, 
and the way of comparison I adopted here. On the other hand, the 
disagreements call for a review of the theoretical foundations of the 
transition state method. _ 

In the next two sections I will, therefore, discuss the validity of the 
three conditions heretofore stated. 


4. 


(a) Adiabatic Condition.—The electronic motion wili be able to 
follow the motion of the nuclei, if that latter motion is not too rapid and 
if the electronic wave function does not change too radically for a small 
change in the nuclear positions. The first of these is fulfilled practically 
always. In all cases in which a non-adiabatic reaction is assumed, the 
rapid change of the wave function in the neighbourhood of an approxi- 
mate crossing of energy levels is made responsible for it. I think that 
for an energy surface which does not show singularities in abnormal 
curvatures, etc., the adiabatic condition cannot cause serious dis- 


crepancies. 


n For H, + I, = 2HI cf., for H, + ICI = HCl + HI, Wm. Altar and 
H. Eyring, J. Chem. Physics, 1936, 4, 661. 

12 H, Gershinowitz and H. Eyring, J. Amer. Chem. Soc., 1935, 57, 985. Cf. 
also O. K. Rice, J. Chem. Physics, 1936, 4, 53. The measurements are due to 
M. Bodenstein, Z. Electrochem., 1918, 24, 183; M. Bodenstein and Lindner, 
Z. physik. Chem., 1922, 100, 87; M. Bodenstein and Ramstätter, ibid., 106; 
Briner, Pfeiffer and Malet, J. chim. physique, 1924, 21, 25. 

13 M. G. Evans and M. Polanyi, Trans. Faraday Soc.,.1935, 31, 875; M. G. 
Evans and M. Polanyi, tbid., 1936, 32, 1333; W. F. K. Wynne-Jones and H. 
Eyring, J. Chem. Physics, 1935, 3,492. For the theory of slow reactions C. E. H. 
Bawn, Trans. Faraday Soc.,:1935, 31, I. 

“R. A. Ogg and M. Polanyi, Trans. Faraday Soc., 1935, 31, 1375; A. G. 
Evans and M. G. Evans, ibid., 1400. Cf. also C. F. Goodeve, sbid., 1934, 30, 60. 
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(b) Quantum Effects.—In the argument leading, to (4) it was neces- 
sary to assume the validity of classical mechanics only tor the activated 
state. In most cases, however, some vibrations of the activated molecule 
will not be in the classical region. Even this will not matter for those 
vibrations which either are too far from the bond to be changed, or, for 
some other reason, do not interfere with the motion of the important 
degree of freedom. It appears indeed natural to use the usual quantum 
expressions for C in (4), even for those degrees of freedom which inter- 
act strongly with the critical bond. A reasonably rigorous argument 
shows 1% that this expression is (when modified by an additional factor 
to take into account tunnelling) valid up to the fourth power of h. This 
may render (4) probable, but we must admit that it does not prove it 
beyond the third power terms in Ah. 

Indeed, one can point out two cases in which (4) with the usual 
quantum expression for C breaks down badly. The first one is given 
in Fig. 4. The zero-point energy corresponding to the vibration along 
OX will appear in (4), not, however, in reality, since the potential barrier 
is very thin. 

_ In the second case, tunnelling under the high region of E, plays an 

important rôle. A negative curvature of E, in the activated state can 

be described as imaginary v. It may be in- 

Xy teresting to note that (4) still holds up to the 

iaa third power of h, if this imaginary frequency 

is used as one of the v of the activated state. 

Of course, the higher power terms of (4) are 

not correct in this case, partly, indeed, because 

= ===X the rate depends on the further continuation 
0 of the series of (2). 

The reason that it is so much more difh- 
cult to apply the transition state method in 
= quantum theory than it is in classical theory 

Fic. 4. is twofold. Firstly, there is no correspond- 
| ing simple expression to Boltzmann's 
exp.(— E,/kT) for the probability of a configuration. But an even 
greater difficulty is that one cannot speak about the mean velocity ọ 
at the activation point. (Heisenberg’s indetermination principle.) 
There is thus, strictly speaking, no way to define the activated complex 
and its lifetime. All that appears to be possible, in general, is to start 
out from the classical expression and develop the quantum corrections 
into a power series of h (only even powers will occur). This has béen 
done to the second power, but the incentive to continue this procedure 
is not very great, partly because of the laborious mathematical work, 
partly because three terms will not be very much better than two are, 
and finally and mainly because our knowledge of energy surfaces hardly 
warrants an elaborate work. 

One can form an idea of the magnitude of the corrections from the 
one dimensional case where, for several potentials, one has exact solutions 
of the Schrödinger equation which make a power series development — 
unnecessary. Bell1® has calculated the penetration of Eckart’s one 
dimensional barrier 17 (Fig. 5a) as function of the temperature. His 
results for the reaction rate, the classical values, for the same conditions 


18 E, Wigner, Z. physik. Chem., B, 1932, 19, 203. 
16 R. P. Bell, Proc. Roy. Soc., A, 1933, 139, 466. 
17 C. Eckart, Physic. Rev., 1930, 35, 1303. 
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and the values of A and Q derived from Bell's results in the usual way by 
representing them as Ae-@IRT are given in Table I. The values of A 
and Q become 1 and 14'5 Cal. for all temperatures in classical theory. 

~ One sees that Q and A both decrease with decreasing temperature. 
This behaviour can be seen qualitatively without calculation: at very 
low temperature, the tunnelling rate becomes independent of tempera- 
ture (which. means Q = 0) and very slow (i.e, A is very small). One 


TABLE I. 


2 . 10720 2'5 . 10-13 r077 3'5 
a3 i . 107° 1:5 . 10-1? 15.10 8-5 
373 3:1. 10-8 3 .107-° 3°10-3 Io 

473 7.3.1077 2 10? 015 11-5 
573 6-4 . 107° 2-9. 107° 0:30 12:5 
673 34.10 2 10 0°55 13:2 


is tempted to use this behaviour for the explanation of the very small 
A values, occasionally observed. It must be remembered, however, 
that even a potential curve like that of Bell gives an appreciable tunnelling 
for very light atoms only (Bell’s calculation is for hydrogen). 


a e 
` ld 


“2 S Pa 

l0 -ery x lo erg 

O TA o 14 
Fic. 5(a). Fıc. 5(b). 


A potential curve which may be realised more frequently, is given 
as a full line in Fig. 55. Such a shape would occur always if two energy 
surfaces nearly cross (the other energy surface is dotted in Fig. 56). 
A non-adiabatic reaction is possible in such cases, but we shall disregard 
this. For such a potential curve one may expect some tunnelling near 
the top even for somewhat heavier atoms. 

The penetration probability can be exactly calculated if one neglects 
the small rounding off of the top of the curve and makes it quite sharp- 
edged. For energies W which are not too near the classical activation 
energy Qo, the penetration probability is 18 


R = 4W*h(Q, — WYO? exp. (— al, — Wh), a = 4aV2M/3h (5) 
where M is the mass of the penetrating particle, a is the sum of the re- 
ciprocal slopes of the two sides of the potential curve. At the temperature 
T = 1/kB the rate of penetration, divided by the classical rate e”f@e/ß is, 
assuming (5) correct throughout, 


œ 
k 
pa = Reena- maw © -O 
0 


18 R. H. Fowler and L. Nordheim, Proc. Rev. Soc., A, 1928, 119, 173. 
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For 1 < B?/a® < BQ, this can be evaluated and gives 


k 32 mh is 
re VOL Vo, e À > : . (7) 


A plot of (7) against T shows that for an atomic weight 10 and the poten- 
tial curve of Fig. 5b, the quantum correction becomes insignificant above 
0° C. It seems, therefore, that apart from reactions involving H, the 
tunnelling effect cannot be made responsible at ordinary temperatures 
for any large decrease of the temperature independent factor. 

These considerations have a particular reference to the dissociation 
of N,O in which the temperature independent factor was found !? to 
be about 107? times the value obtained by estimating (4). This was 
taken, for awhile, as an indication of the non-adiabatic nature of the 
reaction,” which was suggested by the fact that it violates the spin con- 
servation law. However, it has been shown by Zener?! on the basis 
of the interaction integrals obtained from the intensities of forbidden 


Fic. 6. 


lines, that the non-adiabatic character of the reaction cannot be made 
responsible for a factor of the order of 10-5. We saw that the same 
holds of the tunnelling effect, so that it must be admitted that the ex- 
perimental finding is unexplained at present. Perhaps some explanation 
can be obtained from a more detailed level scheme, like the one drawn 
schematically and very tentatively, in Fig. 6. If the lower level arising 
from the 3P term of oxygen is a 3II, level, this will strictly cross the 
12, level of the ground state for a straight configuration of the N,O 
molecule. Then, the reaction will certainly not be adiabatic for a linear 


19 M. Volmer and H. Kummerow, Z. physik. Chem., B, 1930, 9, 141 ; Nagasako 
and Volmer, ibid., 1930, 10, 414 ; Hunter, Proc. Roy. Soc., A, 1934, 144, 386. 

20 H. Eyring, Chem. Rev., 1932, 10, 103; H. Pelzer and E. Wigner,’ 
G. Herzberg, Z. physik. Chem., B, 1932, 17, 68; A. E. Stearn and H. Eyring, 
J. Chem. Physics, 1935, 3, 778. 

21 C. Zener, private communication. The spectroscopic data were interpreted 
by A. F. Stevenson, Proc. Roy. Soc., A, 1932, 137, 298; E. U. Condon, Astrophys. 
J.» 1934, 79, 217. 
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dissociation—not on account of the spin conservation law but because 
of the conservation law for angular momentum around the molecular 
axis. The reaction still can go adiabatically for a bent molecule, but 
it seems that the energy surface becomes so complicated that a larger 
effect of the tunnelling cannot be a priori excluded. 

This example again emphasises how every calculation of reaction 
rates is bound to founder unless we have.more detailed energy surfaces 
than are available at present for N,O. One is also led to believe that 
the energy surfaces show more often a complicated structure, as exempli- 
fied in Fig. 6, than has been assumed hitherto. If this is true, the de- 
viations from (4) become readily understandable, not only on account of 
the uncertainty in the quantum theoretical expression for C, but also 
because the last condition for the validity of (4) will be strongly violated. 


5 


The last assumption of the transition state method was that all 
systems crossing the potential barrier are reacting systems. This 
appears to be quite natural for a simple energy surface of the kind 
illustrated in Fig. 1. Figs. 2 and 3 show, however, that especially for 
more complicated potential surfaces the system is even most likely to 
cross the activated state and to return then to the original configuration. 

It appears thus that the formula (4) will lead, in general, to too high 
values of the reaction rate and should be corrected by a factor y, smaller 
than one, which is the ratio of the number of the crossings through 
F which end in r, coming from / (or end in /, coming from r), to the number 
of all the crossings. The problem is then to calculate y. 

There is at least one case in which y is certainly 1. If the dimen- 
sionality s of the activation surface A is less than 3n — 1 and if the tem- 
perature is very low, then most of the crossings through F will go through 
a very narrow strip of F around A. Even if the system does not go 
straight through into the final state, after having crossed F, but is re- 
flected as in 71% Fig. 2, the probability that it will find that very narrow 
strip through which it can return into the region from where it started, 
is very small. It will be the smaller, the thinner the strip is, i.e. the 
less the energy of the system surpasses Q, Since the mean energy 
excess of the molecules, which have an energy larger than Q,, over 
this energy Q, is proportional to 7, the probability of a return will 
go to zero with decreasing temperature (unless quantum effects should 
interfere), and y goes to 1. However, at higher temperatures y will 
decrease, and, apart from special cases (like that of an intermediate 
valley), it may be quite difficult #2 to find its accurate value. The com- 
plication indicated in Fig. 2 is not quite an adequate picture of the 
situation, since in the 3” dimensional configuration space, the picture 
may be even more involved. 


310 On account of a mistake in drawing Fig. 2, the original substances corre- 
spond to r in this case and the reaction products to}. (In all other examples it 
is the opposite way.) 

33 Mathematically, the problem is the same as that attacked by G. Lemaitre 
and M. S. Vallarta (Physic. Rev., 1933, 43, 87) and by G. Lemaitre, M. S. 
Vallarta and L. P. Bouckaert (ibid., 1935, 47, 434) for the calculation of the 
nn con of cosmic radiation. A complete solution involves in every case very 
much labour. 
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Since one cannot calculate Q, at present sufficiently accurately, the 
comparison of (4) with experiment requires the experimental determina- 
tion of the rate constant at at least two temperatures. This procedure 
tends to multiply the experimental errors and this may be the cause of 
a good part of the discrepancies mentioned in section 3. This difficulty 
would not arise in reactions without a heat of activation, like 


A+B+C=AB+C ran (:) 


However, the transition state method cannot be applied directly to 
this type. 

Eyring, Gershinowitz and Sun * calculated the reaction rate of (8) 
if A, B, C are atoms by a very ingenious method for the case that the 
energy surface consists of two parts, both of which are developable. 
This is the case to a large extent for the H + H + H = H, + H reaction. 
I should like to show how to calculate the rate constant without using 
this assumption, but using a symbolic application of the transition 
state method, as first considered 
by Polanyi.?4 

The energy surface of two at- 
tracting atoms shows clearly that 
one cannot define an activation sur- 
face in the usual way. If one puts 
it at a short distance, the atoms 
which cross it will not necessarily 
dissociate, if one puts it a large 
distance, most of the atoms crossing 
it are coming from the dissociated 
state to begin with (Fig. 7). One 
obtains the impression that it is not 
the passing through a surface in 
space which is important, but the 

Fic. 7. passing of the relative energy of the 

two atoms (kinetic and potential) 

through the value 0. One is thus led to consider the energy (or something 
equivalent) as a co-ordinate instead of the usual space co-ordinates. _ 

Following this line of thought I derived the formula for the velocity 
constant 


k= re (— VeA — Ve) gr, r,? sin edr, dr, de 
r 1 
g = [07/09 + (007 — Vo) lèr h. 

Here B = 1/kT; the relative mass of the two atoms A and B is m, ; 
the potential energy of these at the distance 7, is Vy if C is very distant. 
If C is near, the potential energy of all three is V(r,r,¢), where 7, is the 
distance from A to B, the distance of their centre of mass C’ from C is r, 
and e is the angle between the lines C’C and AC’. The integration has 
to be extended over all the region in which V, is negative. The details 
of the derivation of (9) will be given elsewhere.*te V — Vis assumed to 


3 H. Eyring, H. Gershinowitz and C. E. Sun, J. Chem. Physics, 1935, 3, 
786; H. Gershinowitz, ibid., 1936, 4, 363. 

24M. Polanyi and E. Wigner, Z. physik. Chem., 1928 ; Haber Band, 439. 

240 J, Chem. Physics, 1937, 5, September issue. 


(9) 
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be positive throughout ; (9) will be slightly modified otherwise. A factor 
1/2 enters if A and B are identical atoms. 

The formula (9) for the association reaction is much more complicated 
than that for exchange reactions, e.g., the reason for this is that the latter 
could be simplified by an assumption like (2), since only the neighbour- 
hood of the activation surface was important for these reactions. This is 
not true for the type (8). 

Formula (9) tells us how often the relative energy of two atoms A and 
B decreases in unit time below o, if there is a third atom C together with 
them in a box, assuming complete equilibrium. Again, this will be 
only an upper limit for the reaction rate, because the relative energy of 
A and B may increase above zero immediately after it has fallen below, 
thus necessitating the introduction of a factor y similar to that for ex- 
change reactions. It may be again expected, however, that y will be 
of the order ı if the interaction between C and the A-B attraction is not 
very great. There is one important difference between this and the 
y of Fig. 2 : they for (9) will not go to I with decreasing temperature, and 
will be in general further from 1 than that for (4). 

Equation (9) shows the same dependence on the mass of the reacting 
particles as that of Eyring, Gershinowitz and Sun. This is in agreement 
with measurements on the association of hydrogen atoms.?5 It also shows 
that the “third body ” C will be more efficient which exerts greater 
forces on A and B. For equal chemical properties, the efficiency will 
be roughly proportional to the surface of C. Even so far as numerical 
values are concerned, the disagreement between (9) and experiment 
is not very great, for the 2I +A=I,+A reaction, (9) yields 
k = 20x 107%? cm.®/sec., the experimental value of E. Rabinowitch 
and W. C. Wood * being 3-6 x 10733. The values with He as the third 
body, instead of A, are I-4 X 10% and I-8 X 10733, respectively. 
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University of Wisconsin, 
Madison, Wisconsin, U.S.A. 


38 I. Amdur, J. Amer. Chem. Soc., 1935, 57, 856; W. Steiner, Trans. Faraday 
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The activated complex or transition state method for 
calculating the absolute rate of a chemical reaction with an 
activation energy would be rigorously valid if classical 
mechanics applied to all degrees of freedom. In quantum 
mechanics, two kinds of limitations must be considered. 
First, because of Heisenberg’s uncertainty principle, the 
transition state itself can be defined only if the potential 
surface is sufficiently flat around the highest point of the 
reaction path. Second, even if this condition is fulfilled, 
the transmission coefficient can differ from the value 
expected on the basis of classical mechanics, because a 
wave packet can be reflected both on its way up, and also 
on its way down the potential barrier separating the 
initial and final states. In fact, the transmission coeffi- 
cient is, in many cases, a rapidly fluctuating function 
of the energy of the system. If the temperature distribution 
of the energy is sufficiently broad to cover several pcriods 


I. THE TRANSITION STATE METHOD 
AND ITS VALIDITY 


T is well known that whenever classical me- 
chanics is valid, the formula! 


k= (P./P;)(5/0)-!y = Pidy/ (P38) (1) 


for the rate, k, of reactions with an activation 
energy is a direct consequence of statistical 
mechanics. In (1), P. is the probability of the 
transition state in thermal equilibrium. The 
transition state is a strip of width 6 in configura- 
tion space. This strip lies across the deepest 
saddle on the energy mountain separating the 
two regions in configuration space which corre- 
spond, respectively, to the initial and final state 
of the reaction. P; is the probability of the system 
being in the initial state; ois the average velocity 
with which the configuration points cross the 
saddle; 5/# their average time of sojourn in the 
transition state. Finally, y expresses the proba- 


* Now at Princeton University. 

1 This equation was first proposed by A. Marcelin, Ann. 
chim. phys. (9)3, 120, 185 (1915). Subsequent treatments 
of this nature have been given by A. March, Physik. 
Zeits. 18, 53 (1917); Pelzer and Wigner, Zeits. f. physik. 
Chemie B15, 445 (1932); E. Wigner, Zeits. f. physik. 
Chemie B19, 203 (1932); H. Eyring, J. Chem. Phys. 3, 107 
Hoses Evans and Polanyi, Trans. Faraday Soc. 31, 875 

1 > 


of this fluctuation, an average tı2"smission coefficient 
can be defined which nearly agrees with the classical value. 
For the crossing of a one-dimensional potential barrier, 
the quantum corrections are surprisingly small. In problems 
with several degrees of freedom, the transmission coefficient 
is affected by the interchange of translational and vibra- 
tional energy. However, if the vibrational motion is fast 
as compared with the motion along the reaction path, these 
degrees of freedom can be treated on a par with the elec- 
tronic coordinates. In this case, the formulas of Eyring, 
with a mechanically sensible transmission coefficient, are 
satisfactory. On the whole, we conclude that quantum- 
mechanical considerations invalidate the transition state 
method to a much smaller extent than could be presumed 
and it is only in the consideration of the relative rates of 
reactions between isotopes and reactions at very low tem- 
peratures that these effects may be important. 


bility that a system which crosses the saddle at 
complete thermal equilibrium actually originated 
in the initial state and will proceed to the final 
state to complete the chemical reaction.? The 
transmission coefficient, y, is the only quantity 
appearing in Eq. (1) which cannot be evaluated 
by well-known methods of statistical mechanics 
—it can be estimated only from the general 
shape of the potential mountain. 

There is no reaction with an activation energy 
in which classical mechanics is valid for all 
parts of the reacting system. However, it appears 
logical’ to replace the classical expressions for 
P,and P; in (1) by the corresponding quantum 
theoretical sums. The question as to what extent 
this is justifiable has been discussed repeatedly.‘ 
One difficulty is that the notion of an activated 


2 A more detailed explanation of (1) will be found, e.g. 
Trans. Faraday Soc. 34, 29 (1938). The reaction is con- 
sidered ‘‘completed” when the distance between the reac- 
tion products becomes large compared with molecular 
dimensions. This has a clear sense only in gas reactions 
where the products separate at once to very large dis- 
tances (of the order of the mean free path) if they begin 
to separate at all. 

3 This was first done by E. Wigner (Zeits. f. physik. 
Chemie B19, 203 (1932). It was also the method adopted 
by H. Eyring in developing his formulae of great gen- 
erality. 

“See for example, E. Wigner, Trans. Faraday Soc. 34 
29 (1938). 
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or transition state is not strictly compatible with 
the laws of quantum mechanics. The transition 
state method can only be justified when the path 
of the reaction is sufficiently flat in the neighbor- 
. hood of the transition state so that we can con- 
sider simultaneously the position and the ve- 
locity of the point representing the system in 
configuration space. In classical mechanics, 6 is 
always considered to be so small that the poten- 
. tial is practically constant across the transition 
state. If 6 is large, P, and the average length of 
time that the system spends in the transition 
state become rather complicated functions of ô 
and (1) no longer applies. In quantum mechanics, 
it is necessary to take the width of the transition 
state sufficiently great to allow reducing the 
uncertainty in the velocity, Av, to a small fraction 
of the average thermal velocity, ð. Certainly, if we 
are to apply Maxwell’s formula, exp (— mv?/2kT), 
for the probability of a system with a mass m 
to have the velocity v at the temperature T, the 
spread in the velocity, Av, due to the uncertainty 
principle must be so small that it makes little 
difference whether we use the upper bound, 
v+Av/2, or the lower bound, v—Av/2, in the 
exponential. This means that mvAv must be 
small compared with kT. Putting (kT /m)} for v, 
this gives Av<(kT/m)+. We must take the 
transition state sufficiently wide so that this last 
relation can be satisfied. On the other hand, this 
width must be sufficiently small so that it is still 
a good approximation to take P, proportional 
to 6. Thus, if the energy surface in the vicinity 
of the saddle has a curvature in the direction of 
the reaction path corresponding to the parabola, 
V= Vo—aX?, we must confine the activated 
state to a strip across the path which is so thin 
that exp (—aX?/kT) shall be nearly unity all 
over the strip. This means that 6<(kT/a)}. 
These limits for the maximum values of Av and 
of 6 are only compatible with the quantum- 
mechanical uncertainty principle if: 


h 
a <dmAv<Km(kT /a) (kT /m)}. 
T 


Thus the transition state can only be defined 
when the curvature of the energy surface satisfies 
the condition: 


(h/2r)(a/m)!<kT. (2) 
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Apart from an unimportant. numerical factor, the 
left side of this equation is the zero point energy 
of a mass, m, in a potential, +aX?. Thus, Eq. (1) 
and the transition states method can be justified 
only if this virtual zero point energy is smaller 
than kT. If Eq. (2) is satisfied and if the-de- 
pendence of the potential on the coordinates 
along the strip does not change appreciably from 
one side of the strip to the other, the transition 


state can be defined sufficiently accurately for 


deriving Eq. (1) in spite of the limitations of 
quantum mechanics. Fortunately, this is fulfilled 
in many important cases. 

The computation of y presents another diff- 
culty in the quantum-mechanical applications. 
As long as the reacting system behaves classi- 
cally, y can be estimated from simple geometrical 
and mechanical arguments. Usually the coordi- 
nates in configuration space are chosen so that 
the kinetic energy is the product of an effective 
mass and the sum of squares of the generalized 
velocities. In such coordinate systems, molecular 
collisions are kinematically equivalent to the 
rolling of a ball on a surface whose height is 
proportional to the potential energy of the corre- 
sponding molecular configuration. The question 
whether a given collision will result in a chemical 
reaction can be answered by examining the 
trajectory of the ball when it is shot in the corre- 
sponding manner.: However, when quantum 
theory is used, the ball must be replaced by a 
wave packet. In some reactions y will retain its 
classical value. In others, due primarily to the 
diffraction of the wave packets, it will have a 
different magnitude. Thus y can be much smaller 
than unity in quantum-mechanical systems even 
for an energy surface for which a mechanical 
picture does not indicate an appreciable proba- 
bility of reflection. Consequently, the use of 
Eq. (1) with seemingly reasonable values of y can 
lead to erroneous rates even when the condition 
of (2) is fulfilled. 

If the condition (2) is not fulfilled, (1) must 
certainly be corrected to take into account the 
quantum-mechanical penetration of the potential 
barrier.» But it may not be possible to use a 


SR. E. Langer; Born and Franck, C. Eckart, Phys. Rev. 
35, 1303 (1930); R. P. Bell, Proc. Roy. Soc. A139, 466 
(1933). That (2) is the condition for the absence of an 
appreciable amount of tunneling is evident also from the 
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penetration factor in any simple way to correct 
for the deviations from classical theory. If Eq. (2) 
is not fulfilled, a reflection after the crossing of 
the saddle (which is usually taken into account 
by choosing an appropriate value of y) cannot 
be distinguished from a failure to cross the 
saddle. Thus it would no longer be possible to 
define a transmission coefficient. 


II. THE TRANSMISSION COEFFICIENT 


A. We shall first carry out a rather formal con- 
sideration concerning transmission coefficients, 
which will be based on the assumption that there 
is a definite probability for a system, which has 
crossed the transition state, to be reflected 
back to the transition state. This probability will 
be denoted by p; and p; for crossings from left 
to right and right to left, respectively. It will 
be assumed that these probabilities are equal for 
systems originating from the two sides of the 
transition state and are independent of the 
number of times the system has already crossed 
the transition state. 

The value of the transmission coefficient, y, 
depends of course on the exact nature of the 
reaction surface. It is unity if all systems cross 
the transition state only once in their passage 
from the initial to the final state or from the 
final to the initial state. In the general case, 
some of the systems® which cross the transition 
state will cross it again in the other direction. 
This may happen repeatedly before the atoms of 
the system finally separate, either to form the 
same molecules which originally collided, or else 
to form the molecules corresponding to the com- 
pleted reaction. Thus some trajectories which 
cross the transition state from left to right will 
not lead to a chemical reaction and others may 
require many crossings before the reaction is 
completed. 

At complete thermal! equilibrium, half of the 
systems in the transition state are moving from 
left to right and the other half from right to left. 


formula for the first quantum correction to (1). (Cf. refer- 
ence 3.) 

6 The term “system” is used here also for the point in 
configuration space which corresponds to the position of 
the atoms forming the system. The left side of the transi- 
tion state is supposed to correspond to the atoms forming 
the molecules of the reacting substances, the right side 
corresponds to the reaction product. 
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Fic. 1. The upper drawing shows the reflection of the 
systems on the two sides of the transition state. This is 
similar to the reflection of a beam of light at the air-glass 
and glass-air interfaces in passing through the plate, glass 
shown in the lower figure. 


The transmission coefficient, y, is the fraction of 
the systems moving from left to right at complete 
thermal equilibrium which originally came from 
the initial state and which will proceed to the 
final state without first returning to the transi- 
tion state. At complete thermal equilibrium, A 
systems arrive, in unit time, at the transition 
state directly. from the initial state and B 
systems come from the final state. Fig. 1 shows 
schematically the flux of the different types of. 
systems through the transition region. By simple 
addition, we see that the total number of systems 
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crossing the transition state from left to right in 
unit time is 


Nar=All+oprtp2pr?t+:::) 
+BplltpiprtPPpf+**) 
=(A+Bp)(1- pips) (3) 


and the number of those crossing it from right 
to left is 


Nwi=Apitpipstp2ort+::-) 
+B(1+p:p;+ pfp Pt: -) 
= (A p; +B) (1 — pips). 


At equilibrium Ni,,=N,41, and hence 
B=A(1—p;)/(1—p)). (5) 
Substituting this expression for B into Eq. (3), 
Ninr=A/(1— pi). (6) 


However, the number of systems which have 
originated in the initial state and which proceed 
to the final state is seen to be 


Ning =A(1— plt pipt pprt.) 
=A(1—p;/)/(1— pips). (7) 


The transmission coefficient is the ratio of Ni, 
to Nor 


= Nis /N m= (1— 9:)(1— 97)/(1— pis). (8) 


The assumption made at the beginning of this 
section that the systems crossing the transition 
state from left to right have the same probability 
for being reflected no matter whether they 
originated in the initial or in the final state 
(or how many times they have already crossed 
the transition state), is not always justified. 
‘We shall see in Section 3 that for a one-dimen- 
sional barrier, the transmission coefficient for any 
one value of the energy does not necessarily 
satisfy Eq. (8). However, the average trans- 
mission coefficient for many systems with slightly 
different energies will agree with the above 
expression. 

We can compare the transmission of a wave 
packet through a potential barrier, as considered 
in this section, with the passage of a beam of 
light through a piece of plate glass partially 
silvered on both sides. (See lower part of Fig. 1.) 
Light of one particular frequency shows a very 
complicated diffraction pattern. If the direction 
of the beam is sharply defined, the amount of 


(4) 
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light passing through the glass depends on the 
exact wave-length. For light of a range of 
frequencies, the transmission no longer depends 
critically on the wave-length and will be given 
by (8) (provided that the thickness of the plate 
is large compared with the wave-length). From 
this analogy we see that Eq. (8) can apply for 
the average transmission coefficient of an as- 
sembly of states in thermal distribution, not, 
however, for the transmission coefficient of a 
single quantum state. The example in Section 3 
should make this more clear. 

B. In classical theory, for low temperatures, 
p; and p; approach zero as it is very improbable 
that a system with barely enough energy to cross 
the activated state will find its way back.‘ For 
higher temperatures, the reflection coefficients 
may increase. This must be expected, in par- 


‘ticular, if the vibrations in the transition state 


are less stiff than in the initial state, i.c., for 
the exceptionally fast reactions. 

We shall consider an energy surface on which 
the energy and vibrational frequency change 
within a very short. distance along the reaction 


‘path. The abruptness of the energy change 


tends to make y small; while the straightness of 
the reaction path tends to make it large. We shall 
suppose that the energy surface at the initial 
state has a smaller curvature perpendicular to 
the reaction path than at the final state, but that 
the minimum energy of the initial state is higher 
than that of the final state. Fig. 2 shows such an 
energy surface. All the systems of low energy 
pass from the initial to the final state, but part 
of the systems of high energy are reflected. 
If the potential energy is Vi=A-+a;x? for the 
initial and V,;=a,x? for the final state, only those 
systems can be reflected which lie outside the 
point of intersection, x’ =A!/(a;—a,)}. This is a 
negligibly small fraction of the systems at low 
temperatures and hence y=1. The reflection 
plays an important role only at temperatures so 
high that kT >a.x". This is equivalent to the 
condition: 


A/RT <a,/a:—-1=(r,fv)?—1, (9) 


where v; and vy are the vibrational frequencies in 
initial and final states. 

The existence-of a limiting temperature, above 
which y becomes small, will always be true 


180 Part I. Physical Chemistry 


620 J. 


Intiel 


Fıc. 2. Potential energy surface illustrating the variation 
of the transmission coefficient with temperature. From 
classical mechanical considerations, only those systems can 
be reflected which have sufficient energy to reach the 
baffle. The coordinate axes refer to the upper figure. 


whenever the frequencies in the transition state 
are lower than in the initial state. The change in 
energy between the initial and the final state 
does not have to take place abruptly nor does 
the reaction path have to be straight. This 
becomes evident if we compare the rate of the 
reverse reaction, according to Eq. (1), with the 
total number of collisions in which the energy of 
the colliding particles is greater than A. Clearly, 
this latter quantity gives an upper limit for the 
rate.’ The number of collisions involving energies 


greater than A is 
(kT/2rm)!(A/kT-+1) exp (—A/RT). (10) 


This must be larger than the reaction rate 
according to Eq. (1): 


k= y(vs/vs)(RT/2xm)' exp (—A/RT). (11) 
Thus y must be less than unity if 
A/kT <v,/v—1. (12) 


The limiting temperature for a transmission 
coefficient of unity as given by Eq. (12) is about 
right if the reaction path is straight and the 
transverse vibration frequencies change slowly 
along the reaction path. The temperature given 
by (12) is much higher than that of Eq. (9). 
This corresponds to the fact that y is larger for 


7K. F. Herzfeld, Zeits. f. Physik 8, 132 (1922); M. 
Polanyi, Zeits. f. Physik 1, 337 (1920). 
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energy surfaces having smooth rather than 
abrupt changes. In actual chemical reactions, 
y will depart from unity somewhere between 
these two temperatures. Thus, in general, the 
transmission coefficient falls below 1 at high tem- 
peratures for classical mechanical reasons. We 
shall see in the next section that at low tempera- 
tures, where y should be 1 according to classical 
mechanics, it is also smaller than 1 for quantum- 
mechanical reasons. 


III. ONE-DIMENSIONAL RATE PROBLEMS 


The simplest examples of reaction rate involve 
motion in only one dimension. If the activated 
state method applies, we can divide the calcula- 
tion of + into the two easier problems of deter- 
mining the reflection coefficients py and p: which 
refer to the passage from the activated state to 
the final and to the initial states, respectively. 
For one-dimensional problems these reflection 
coefficients are, of course, zero in classical 
mechanics. 

In the special case of Eckart’s potential 
functions, 


V(X)=A(itexp (27X/D)-, (13) 


decreases from A to 0, as X increases from large 
negative values (initial state) to large positive 
values (final state). The energy drop takes place 
around X =0 in a distance roughly equal to 1/2 
(Fig. 3). The reflection coefficient corresponding 
to the motion from negative to positive X is: 


exp (2rl(p—-g)/h)— 17? 
=| ——— Arlg/h), 4 
Fr nye le Eea 


where g=(2m(E—A))! and p=(2mE)? are the 
momenta of the system before and after the 
potential drop. As this drop becomes abrupt 
(1—0), the reflection coefficient becomes 


p= (p—g)*/(p+9). (15) 


The explanation of this reflection is similar to 
that for the reflection of light at a glass to air 
interface. In quantum mechanics the dynamical 
system is represented by a wave packet with a 
wave-length which decreases from h/q to h/p as 
it passes across the potential drop. The energy 


change can therefore be expressed as a change in 
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the index of refraction and the analogy with 
light is complete. 

In Fig. 4 we have plotted the reflection coefh- 
cient for an H atom when passing across abrupt, 
steep and gradual Eckart potential drops of 
10 kcal. The gradual drop, /=1.0A, is still some- 
what steeper than those which occur in the 
majority of chemical reactions. We see that the 
reflection coefficient approaches zero rapidly as 
the energy of the system is increased. One will 
expect, therefore, that the quantum corrections 
in one-dimensional problems are important only 
for reactions at very low temperatures involving 
light molecules or for isotope separations where 
slight differences in rates are significant. 

Strictly speaking, this conclusion holds only 
if the barrier is so wide that (2) is satisfied, and 
consequently the method of Section 2 can.be 
used to express the transmission coefficient in 
terms of the reflection coefficients. Otherwise the 
transmission must be computed by quantum- 
mechanical methods. For the sake of concrete- 
ness, let us consider a potential barrier with 
abrupt energy changes (Fig. 5). We denote by 
pi, q and py the momenta of the system in the 
initial, transition, and final states. Substituting 
the reflection coefficient (15) into (8), we would 
obtain for the transmission coefficient of the 
barrier: 


49 =4qpibs/ (P+ dibs) (bi td). (16) 


In this expression there is no dependence of the 
transmission coefficient on the width of the 
barrier, d. 


. ECKART POTENTIAL 


A 


Fic. 3. A special case of the Eckart potential gives a 
function which changes smoothly from one constant value 
to another. 
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j REFLECTION COEFICIENTS 
AT A POTENTIAL DROP OF 10 KCAL. 
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Fıc. 4. Probability of reflection at abrupt, steep and 
gradual potential energy drops as a function of the energy 
of the system. i 


One can compute the exact value of the trans- 
mission coefficient by fitting together the wave 
function and its derivative at the discontinuities 
of the potential energy. One obtains in this way 


Ag’pidr 


zn 17 

(pit ps)? cos? e+ (q?+ pis)? sin? ¢ ee 
where o=2#rgd/h. This is evidently different 
from (16). If the width of the barrier is not large 
compared with the wave-length, quantum me- 
chanics prevents us from even defining a transi- 
tion state. If the barrier is wide enough to 
justify the activated state method, the exact 
transmission coefficient of Eq. (17) shows rapid 
fluctuations with the energy. Averaging y over 
a small energy range (which corresponds under 
these conditions to averaging over ¢) one obtains 
an averaged transmission coefhcient, which turns 
out to be equal to the transmission coefficient 
of Eq. (16): | 


(18) 


It is only in this sense that Eq. (8) is valid. For 
any particular energy, the waves reflected at the 
two discontinuities of the potential interfere with 
each other to aid or to hinder the transmission. 
The effect of this interference averages out, 
however, for reasonably large energy ranges. 
Figure 6 shows y and 7 plotted as functions of 
the energy of the impinging particles. Here 
A,;=A2=A=10 kcal. and d=1.0A. The exact 
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Fic. 5. One-dimensional energy barrier with abrupt changes 
in the potential. 


transmission coefficient, y, oscillates about 7. 
The oscillation of the transmission coefficient 
becomes quicker with increasing mass of the 
particles. It is interesting to note that the 
maximum in the transmission coefficient occurs 
for deuterium before it occurs for hydrogen so 
that at an energy of 10.2 kcal., the transmission 
coefficient for the former is about five times 
greater than for the latter. The situation is 
reversed, however, at 10.5 kcal. 

It seems tempting, at first, tọ try to utilize 
the large ratios of transmission coefficients, as 
shown in Fig. 6, for the separation of. isotopes. 
This is difficult, however, for several reasons. 
First, because the energy of the reacting systems 
is not restricted to one single value in actual ex- 
periments but covers—according to the Maxwell- 
Boltzmann distribution formula—a region of 
appreciable width so that the fluctuations shown 
in Fig. 6 average out to a large extent. Further- 
more, the ratio of transmission coefficients 
fluctuates not only as a function of the energy 
but in many dimensional problems also as a 
function of the other parameters of the problem. 
As these parameters also cover a range of values 
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Fic. 6. Transmission coefficients for H and for D atoms 
passing over a one-dimensional potential energy barrier. 


for the reacting systems, the fluctuations tend to 
average out to an even larger extent. Finally, 
the fluctuations are as pronounced as in Fig. 6 
only if the top of the barrier has a flat portion, 
which is not very much shorter than the wave- 
length. 

On the other hand, one can, in principle, 
always obtain large separations at very low 
temperatures, where ‘‘tunneling’’ becomes im- 
portant. In quantum mechanics, systems with 
less energy than the activation energy may still 
react although only with a small probability.® 
The transmission coefhcient is still given by (17) 
where now gq is imaginary. 

In unit time the number of molecules with an 
energy between E and E-+dE which hit the 
barrier is proportional to exp (—E/kT)dE. The 
ratio of the actual rate, k, to the classical rate, 
Reiass, is then: 


f y(E) exp (-E/RT)dE+ f "y(E) exp (-E/RT)dE 


keissa 


Rthrough Rovér 


Kelans ke loss 


iR exp (-E/kT)dE 


(19) 


8 Similar calculations were made before particularly by R. P. Bell, J. Chem. Phys. 2, 164 (1934); Proc. Roy. Soc. 
A139, 466 (1933); H. Eyring and A. Sherman, J. Chem. Phys. 1, 345 (1933); C. E. H. Banri and G. Ogden, Trans. 


Faraday Soc. 30, 432 (1934). 
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Figure 7 shows the relative number of D atoms 
which react at 126°K as a function of their 
energy. There are two peaks in this curve. One 
corresponds to the reactions, Rthrough, due to 
penetration of the barrier and the other, Rover; 
corresponds to passage over the barrier. At a 
slightly higher temperature the penetration is 
unimportant; at a slightly lower temperature 
the passage over the barrier is unimportant. 
At high temperatures, where classical theory is 
valid, the rate of the crossing of the barrier is 
2! times higher for hydrogen than for deuterium, 
owing to the higher velocity of the former. 
Table I shows the rates of crossing, for H and 
for D, relative to the classical rate of H at 252°. 
Here Rtrans iS the reaction rate which would be 
predicted on the basis of the transition state 
method using the quantum-mechanical value for 
the reflection coefficients, and Eq. (8) 


I) 


f z(E) exp (- E/kT)dE 


trans A 


w 


f exp (-E/kT)dE 
A 


Retase 


At room temperatures, the penetration of this 
barrier is: unimportant; the separation of the 
isotopes is small; and the transition state method 
is satisfactory. At low temperatures the transi- 
tion state method is inapplicable since most 
reactions occur by penetration of the barrier. 
Here the reaction rates are thousands of times 
faster for the lighter isotope. The above table 
is in agreement with the calculations of Bells 
who used an Eckart type of potential barrier. 
From this table we can make the following 
observations: At very low temperatures an 
experimenter would find that a reaction of this 
nature has an anomalously small steric factor 
and shows practically no “activation energy.”’ 
(But at temperatures so low that Rthrough iS 
greater than Rover, the rates are so small that 
they can hardly be measured.) At high temper- 
atures he would obtain an activation energy 
agreeing with his classical expectations but the 
steric factor would still be small by a factor of 
two or three. 
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Fic. 7. Relative number of D atoms which succeed in 
crossing a potential energy barrier 10 kcal. high and 1A 
wide plotted as a function of their energy. The systems 
approaching the barrier have the energy distribution corre- 
sponding to 126°K. 


IV. Tue SIMPLEST MANY-DIMENSIONAL RATE 
PROBLEM—EYRING’S FORMULATION 


Reaction rate problems in many dimensions 
are more complicated than the one-dimensional 
examples principally because of the interchange 
of energy between the various degrees of freedom. 
In this section we consider the simplest of all 
many-dimensional problems—a straight reaction 
path along which the motion is slow and perpen- 
dicular to which the motion is fast. 

If we designate by X the coordinate along the 
reaction path and x the coordinate perpendicular 
to the reaction path (considering only one such 
coordinate x for the sake of simplicity), the 
potential energy of the system is U(X,x). We 


TABLE I. 
KTHROUCH | fover 
T Rcuass | crass k kerass RTRANS 
84° 1.7 108 | 0.15 | 7.1-1075 | 4.3 -10723 3.2.1019 
H | 126°} 7.2-10% 0.22 | 1.5-1074 | 2.1-1079 | 2.9. 10710 
252°| 0.014 0.35 0.364 1.0 0.39 
84° | 1.5-108 0.22 |4.6-10-1% 3.0-10-18] 2.3-10719 
D |126° 0.76 0.26 | 1.5-1079! 1.5-1079 | 2.0-107-1° 
252°} 0.0023 0.37 0.26 0.707 0.28 
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denote by xo the point x at which U(X, x) 
assumes its minimum value for a given X. 
Obviously xo is a function of X. U(X,x), asa 
function of x has approximately the shape of a 
parabola in the neighborhood of x=xo. If the 
motion along the reaction path is so slow that 
the system can vibrate many times in the x 
direction before it proceeds so far in the direction 
of X that the curvature of the parabola is 
changed considerably, the motion perpendicular 
to the reaction path can be considered as simple 
harmonic with the frequency v(X). The po- 
tential energy which is responsible for the motion 
of the slow coordinate X is then 


Vn(X) = U(X, xo) + (n+$)ho(X). (20) 


Here n is the quantum number of the vibration 
along x. If V„(X) is a slowly varying function 
of X, the motion along the reaction path will 
obey the classical equations of motion. 

It is essential in the derivation of Eyring’s 
formulae! that the moticn in the fast and in the 
electronic coordinates be adiabatic in the sense 
that neither the electronic quantum states nor 
n undergo any changes. We can use the ordinary 
methods of perturbation theory to obtain 
the corresponding condition on the velocity, 
v=aX/dt, and on the change in the vibration 
frequency along the reaction path, dv(X)/dX, 


dv/dt=vdu(X)/dX<v*. (21) 


Here v can be much greater than the velocity cor- 
responding to the average thermal energy since 
it must be fast enough to permit the crossing of 
the potential barrier. Thus (21) is more stringent 
than for the case of motion with ordinary 
thermal energies of the order of AT. 

Under the above conditions, the motion of a 
system with the vibrational quantum number n 
will obey classical mechanics and will be governed 
by the potential (20). Therefore, the number of 
systems with this vibrational quantum number 


which react in unit time will be given by (1) 


into which : 
Pin=d exp (— V,(X,)/kT) 


must be substituted for P,, where V,(Xo) is the 
highest value of V,(X), corresponding to the 
activated state. | 
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The total reaction rate is the sum of the rates 


of all systems in all possible vibrational states. 
This is 


Puöy/öP;=Püöy/öp,, (23) 
where 
P=} Pin = ô exp (— Vo(X0)/kT) 
X(1—exp (—hv(Xo)/kT))-!. (24) 


Thus under these assumptions we are led back 
to Eq. (1) except that the quantum theoretical 
probability appears in it, instead of the classical 
one. 

It may be worth while to remark in this 
connection that if the reflection coefficients are 
zero and the molecules are distributed before 
the reaction according to thermal equilibrium, 
the thermal equilibrium will be maintained 
throughout the course of the reaction. Consider 
each vibrational state as forming a separate 
energy surface with respect to the motion along 
the reaction path. On the surface characterized 
by the vibrational quantum number n, the 
density at any point, X, is given by the baro- 
metric formula, C, exp ((V,„(initial) — V,(X))/ 
kT). The condition that the systems be at 
thermal equilibrium initially is that 


C„=C exp ((Vo(initial) — V,(initial)) /RT). 


From this it follows that the ratio of the density 
on the uth to that on the zeroth vibrational 
state is exp ((Vo(.X) — Va(X))/kT) for any value 
of X. 

The consideration of this section shows that 
it is easy to justify Eyring’s formulae with a 
mechanically sensible transmission coefficient if 
the coordinates can be divided into some which 
involve slow motion and others which involve 
fast motion. The slow motion can be described 
by classical mechanics. The fast motion can be 
treated on a par with the motion of the electrons: 
the corresponding quantum numbers undergo no 
changes during the reaction. This condition 
restricts the consideration of this section to 
potential surfaces where the frequencies along 
the reaction path change slowly. 
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Fic. 8. Two-dimensional potential energy trough with 
a oe reaction path and an abrupt change in the poten- 
tial. Here the G’s are the vibrational energy levels in the 
initial state and the F's in the final state. 


V. ABRUPT CHANGE OF POTENTIAL ENERGY ON 
Two-DIMENSIONAL SURFACES 


The difficulty of solving partial differential 
equations restricts the number of many-dimen- 
sional rate problems which we can treat accu- 
rately. In this section we consider examples in 
which the reaction path is straight and the 
change in the potential occurs in a very short 
distance. It would also be desirable to consider 
the effect of curvature in the reaction path. 
The potential energy surfaces are constructed 
so that the Schrédinger equation is separable in 
each of a number of regions. In each region, the 
wave functions are obtained as solutions of 
ordinary(one-dimensional) differential equations. 
The wave functions are then pieced together so 
that the functions and their derivatives are 


continuous at the boundaries of every two 


regions. This is carried out by matrix methods 
and the complete mathematical treatment is 
given in the appendix. 

Let us consider an energy surface similar to 
that shown in Fig. 8. For negative X, the 
potential energy is Vi(X) =A +a,x*; for positive 
X, the potential energy is V;s(X)=a,x*?. This 
surface is the two-dimensional analog of the 
potential drop of Section 3. Eq. (8) enables us 
to obtain the average transmission coefficient 
from the reflection coefficients at two energy 
drops similar to that of Fig. 8. The case in which 
the energy of the system is large compared with 
the vibrational energy of any of the levels in 
the final state to which the waves are likely to 
be transmitted is particularly easy to treat. 

The vibrational quantum number of the 
reflected systems can differ, to the approximation 
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used in the appendix, only by 0 or +2 from the 
vibrational quantum number of the incident 
systems. Eqs. (36) and (40) of the Appendix 
show that even the change by =2 is quite im- 
probable as e is always a rather small quantity. 
The transmitted waves can have a vibrational 
quantum number differing from that in the initial 
state by an even number. In Fig. 9, we have 
plotted the probabilities of reflection and trans- 
mission into states with different vibrational 
quantum numbers. One sees by comparison with 
Fig. 4, that the reflection coefficient is rather 
similar to that of a system with equal transla- 
tional energy in a one-dimensional problem. 
One sees, furthermore, that the vibrational quan- 
tum number has a marked persistence even in 
this case of an abrupt potential change. This 
means that the vibrational energy increases if 
the frequency is larger after the drop, and de- 
creases in the opposite case. The changes in the 
vibrational frequency, inasmuch as they occur, 
have a tendency to counteract this. Thus the 
vibrational quantum number is more likely to 
decrease if the vibration is stiffer after the drop 
and is more likely to increase in the opposite 
case. A more detailed discussion is given at the 
end of the Appendix. 

The persistence of the vibrational quantum 
number is doubtless in close connection with the 
slowness with which the equilibrium distribution 
of vibrations is established. At higher vibrational 
quantum numbers changes in the vibrational 
quantum number become more probable which 
is also in qualitative agreement with experiment.’ 

For the study of reaction rates, the reflec- 
tion coefficients are most important. It is seen 
that in the case investigated here—large kinetic 
energy in the final state—these reflection coeff- 
cients are not increased over the value they 
assume in a similar one-dimensional problem. 
This leads to the conclusion that the transmission 
coefficient is even in this case not far from unity 
and Eyring’s well-known formulae remain ap- 
plicable. It would be desirable to obtain a more 
general solution of the problem considered in 
this section than that given in the Appendix and 
we intend to return to this question at a later 
time. 


° Cf., F. Patat, Zeits. f. Elektrochem. 42, 265 (1936). 
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PROBABILITIES OF REFLECTION AND TRANSMISSION 
2 AT ABRUPT DROP OF 9 KCAL. IN POTENTIAL ENERGY 
WHEN VIBRATIONAL FREQUENCY CHANGES SIMULTANE- 
= OUSLY FROM hy= 4 Kcal TO hy = 2kcal 
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Fıc. 9. Probabilities of reflection and of transmission at an abrupt potential 
drop of 9 kcal. when the vibrational frequency changes simultaneously from 
hv,=4 kcal. to hyy=2 kcal. The lower curves are for systems initially in the zeroth 
vibrational state; the upper curves are for systems initially in the 2nd vibrational 


State. 


APPENDIX 


Calculation of transmission and reflection coeffi- 
cients for many dimensional problems 


Let us denote by X the coordinate along the 
reaction path and by x the coordinate (or 
coordinates) perpendicular to it. For X <0 the 


potential energy is V.(x); for X>0 it is Vs(x). 


The characteristic functions and characteristic 
values of the Schrödinger equation Hy = Ip with 
the potential V;.(x) shall be ¥.(x) and J,, those 
of the Schrödinger equation H;y= Fg with the 
potential V,(x) will be (x) and F+. Greek 
letters as subscripts refer to the initial state, 
roman letters to the final state. Expanding the ¢» 
in terms of the y we obtain 


pala) = Eura) (26) 


and z is a unitary matrix 


(ut) sk = Ukr”. 


(27) 


We shall consider an incident wave from the 
left having the form y.(x) exp (2irg.X/h). It 
will give rise to a reflected beam in which the 
atoms have all possible quantum numbers X for 
their motion in the x direction. Thus the wave 
function for X <0 becomes 


¥.(x) exp (27ig,X /h) 
+ LRava(x)exp (—-2rigX/h) (28) 


with 


q2/2m+I,=E, (28a) 


where E is the total energy of the system. For 
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X>0 we have only an outgoing wave 
LT xl) exp (2rip,X /h) 
k 
= LT kU erWr(x) exp (2ripıÄ/h), (29) 
BD 


where again 
(29a) 


Naturally, the 9, and q with high quantum 
numbers x and k will be imaginary. The corre- 
sponding terms represent local disturbances in 
the neighborhood of the discontinuity. The 
imaginary p and g, must be, if divided by z, 
positive. Terms with imaginary momenta of the 
opposite sign of p or g would correspond to 
waves which have progressively greater ampli- 
tudes as we depart from the discontinuity. The 
total number of systems with quantum number 
A reflected in unit time by the discontinuity is 
R= = | Rerl?qrn/m, if gy is real (there is of course 
no reflected wave with imaginary gı). Similarly, 
the total number of systems transmitted in 
unit time with the quantum number k into the 


p2/2m+ F=. 


final states is T,= | Txe |?Pr/m if px is real, zero _ 


if p is imaginary. There are q,/m systems 
incident in unit time.!? 

Both the wave function and its derivative 
must be continuous at X=0. Comparing the 
coefficients of Yı(z) of Eqs. (28) and (29) and 
of their derivatives this gives: 


bat Ra= LT art, (30) 
k 

ôg — Raga = IT rtkpk. (30a) 
k 


The formal solution of these equations can be 
obtained by matrix algebra. We can write for 
(30) and (30a) 

1+R=Tu, 


qg—Rq=Tpu. 


(31) 
(31a) 
Here 1 is the unit matrix, p and g diagonal 
matrices while R, T and u are general matrices. 
We have, from (31) and (31a), T=(1+R)u7 
=(1—R)gu-'p- and hence, with = qgu”!p"!u, 


R=(¢—1)(¢+1);  T=2¢(¢+ 1). (32) 


10 The matrix method for problems of atomic scatterin 
te ag initiated by J. A. Wheeler, Phys. Rev. 52, 110 
1937). 
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We can use (32) to verify that the total number 
of incident systems is equal to the number of 
reflected and transmitted systems. The reflection 
and transmission matrices R and T for the 
problem in which the potential is Vy in the 
initial and V; in the final state, can be calculated 
from Rand T 


R= —uRu; T=u(i—R). 


Naturally, (32) represents only a formal 
solution of our problem, and the matrix elements 
of R and T still remain to be calculated. We 
shall obtain them for the special case in which 
the total energy E is very large compared to 
the energies F, of the levels to which there is an 
appreciable transition. This means that in the 
final state the translational energy pi7/2m> Fk 
will be large compared with the vibrational 
energy, although this need not be true for the 
initial state. 

In this case F&E and" we can write, denoting 
by f the diagonal matrix with the diagonal 
elements Fi, 


(32a) 


paquip ty 


=qu(2mE)i1+f/2E)u=to+tti, (34) 
where we have with P= (2mE)? 
f=Pg; f= ePCEgu-ifu (34a) 


and {, is small compared with fo. Now we can 
write, up to terms of second order in {ı 


R= (So-14+$1)(Sot+14+ 8)" 
= (Fo-1 4 $1) (Sot NLA ++ HIHI 
= (So— 1451) (Sot HILL ttot 1) 
= (So-1) (Sot L A+ 2(So+1) 7 i(Fo+1). (35) 


Hence 
q«— P PE QF 


Ra = — tt 
q+P (4.+P)(a+P) 


(36) 
where 


Fa= Yun Fete = f YAH rh 
k 


For the amplitude of the wave reflected without 
any change in the vibrational quantum number, 


11 In this calculation, the zero of the energy is at the 
bottom of the potential curve V(x). E and P are numbers, 
the other symbols matrices. 
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the expression 
Ra= («- Px’) (act p.) t; Di /Am+ Fa =E (37) 


is correct up to second-order terms in F. This 
shows a marked analogy to (15) with the 
difference, however, that p,’ enters instead of p. 
The former is smaller than the latter, at least 
for k=1, as Fıı>Fı. Hence, the reflection 
without change of the vibrational quantum 
number is smaller in the two-dimensional case 
than the whole reflection in the one-dimensional 
case. In order to calculate the F,, we can note 
that 


Fa= [vem= [pH fvt Var 


S143. Vs (38) 


If V; and V; have both the potential of a simple 
harmonic oscillator, 


V;=A+27’mv?x?; (39) 


(36) and (38) show that the vibrational quantum 
number can change only by 0 or #2 in our 
approximation. Then 


(Vi— V)a=A+tchviek+}), 
(Vi— Vera (Vi— Viet 
= Fehv;(x+1)8(x«+2)! 


with e=(v2—v,/)/»2. There is no change what- 
ever in the vibrational quantum number if the 
frequencies in initial and final states are equal. 
In general, the probability of a change increases 
with increasing vibrational energy in the initial 
state, and also with increasing translational 
energy as long as the latter remains small 
compared with the total drop A in the potential.” 


Vj= 2mm Px, 


(40) 


12 It should be remembered that the probability of re- 
flection is | Ren l°9a/g«, not Re- 
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For the transmission matrix we obtain from 
(31) 


Ta= > (ôa tH Raun" 
2q. PET!\«Faun* 


= E a a 
= Gt PtP) ae 


The second term contains the factor F/E and 
is much smaller than the first. We shall evaluate 
(41) also for the potentials (39). In this case Ty: 
will be zero unless the change (k—x«) of the 
vibrational quantum number is even. This is 
because both F,, and k: vanish, unless the 
difference of their two indices is even. The first 
few u are 


too = (1—e)*(1—e+ 1e), 
Ilo = — U20 = 2—3tto0e(2 —e)7}, 


ttg2 = Uoo(1 —e— ge”) (1 —e-f te). 


(42) 


If the change in vibrational frequency is not too 
large, e will be small. Under these conditions, 
at least for low values of the indices, the di- 
agonal elements of x are nearly 1. The off- 
diagonal elements are much smaller, as 2x, 
contains the $(k—x«)'th power of «e. Thus the 
probability for a change in « is small. For large 
values of the quantum numbers, the vibrational 
energy in the final state covers a wider range. 
In the classical region, it extends from the 
vibrational energy in the initial state I, to 
I,(1—e). Its mean value is still very nearly the 
same as if there was no change in vibrational 
quantum number: it is J,(1—4e) instead of 
I(l- e}. 

The authors would like to express their 
appreciation to the Wisconsin Alumni Research 
Foundation for financial support throughout the 
course of this work. 
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Effects connected with a more complicated nature of the energy surfaces of relatively simple 
reactions are discussed. The theory of elementary association reactions is examined and it is 
shown that better results are obtained in the rate of association of atoms when three additional 
attraction states are taken into consideration. The rates of association of iodine, bromine and 
chlorine atoms are calculated and compared with the experimental rates. The improvement to 
be expected from consideration of the angular momentum of the associating pair of atoms is 


discussed. 


VERY modern theory of reaction rates has 

to deal with two essentially different kinds 

of: problems. The first problem concerns the 

forces acting between the atoms participating in 

the reaction, the second one is a statistical ques- 

tion and aims at a calculation of the rate con- 

stant, assuming that the answer to the first 

question is known. We shall discuss first the 

reactions with, then those without, activation 
energy from this point of view. 


I 


The solution of the statistical problem is 
probably known at present,! for gas reactions 
with an activation energy, to such an extent that 
not very much further progress can be expected 
in this connection in the near future. To be sure, 
several points of the theory still remain to be 
cleared up,? particularly when quantum effects 
play a major role and also very few examples of 
nonadiabatic reactions have been considered so 
far in detail. However, the remaining questions 
are of a more special nature and appear to be so 
dificult that we must feel satisfied, for the time 
being, with a partial solution. 

On the other hand, our knowledge of the 
forces between atoms is far from being complete 
enough to allow the accurate calculation of reac- 
tion rates with activation energies. Considering 
the difficulties involved in explaining the binding 
energies of well-known ordinary compounds, one 
will be hardly surprised to find that—apart 


* Presented at the symposium on “Kinetics of Homo- . 


geneous Gas Reactions.” See page 633. 

1 Cf. various papers in the Trans. Faraday Soc. (1937) in 
the general discussion on reaction kinetics. 

2 Cf. also li O. Hirschfelder and E. Wigner, J. Chem. 
Phys. (in publication). 


from one single instance?—no accurate calcula- 
tion of activation energies exists. It is more un- 
satisfactory that even the general character of 
the forces between the atoms of a reaction are 
known only in the very simplest cases. Although 
one could numerically predict rates of reactions 
with activation energy only if one were able to 
calculate the activation energy accurately, the 
knowledge of the general properties of the forces 
between atoms would be valuable in any case by 
enabling us to compare rates of different reac- 
tions on a more satisfactory basis. 

The general character of the forces between 
atoms, i.e., the “energy surface,” is well known 
only for a system of three atoms, at most, all of 
which are in S states. There are several reasons 
to believe that the energy surfaces are much 
more complicated in other cases than these 
known energy surfaces are. Firstly, the number of 
ways atoms in P and higher states can be orien- 
tated with respect to each other is much greater 
than for atoms in S states. Second, it appears at 
present most probable that the reason for the 
discrepäncies between experimental and the- 
oretical reaction rates is due to the unexpectedly 
complicated. shape of these potential surfaces. 
Strong indications in the same direction are 
given also by preliminary calculations of Magee‘ 
who considered a system formed by a P and two 
S atoms. Finally Knipp® has shown that com- 
plications can arise even in the case of two atoms 
inasmuch as states which are known not to be 
attractive at short distances become weakly 


3 ek Eyring and Topley, J. Chem. Phys. 4, 170 
936 


1 : 
1J. L. Magee, unpublished. 
5 J. Knipp, Phys. Rev. 53, 734 (1938). 
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attractive for larger separations, while states 
which are attractive at short distances are in 
many instances repulsive at large distances. 
Thus, there is no energy surface, as in the case 
of only S atoms, which is lower than all other 
energy surfaces for all configurations. 
Possibly, the explanation of a very puzzling 
phenomenon, observed by Polanyi® and col- 
laborators, and first emphasized in this con- 
nection by Goodeve,’ lies also in a similar direc- 
tion. The phenomenon is the excitation of a Na 
atom, with a probability approaching 1, by an 
excited NaCl* molecule obtained by the reaction 
Na,+Cl=Na+NaCl*. Since the whole system 
is, originally, on the lowest energy surface, it 
should remain there, according to the theory of 
the adiabatic nature of these reactions.* How- 
ever, a system containing an excited Na atom is 
clearly not on the lowest energy surface. Accord- 
ing to the customary ideas, the NaCl* molecule 
has only vibrational and rotational but no elec- 
tronic excitation. The two ions in the molecule 
vibrate against each other very strongly—in the 
rather improbable case that the NaCl* contains 
all the reaction energy of 3.35 electron volts, the 
maximum distance of the Na and Cl ions be- 
comes about 6.5A. This is, nevertheless, much 
less than the distance at which the molecular 
curve, corresponding to the ordinary heteropolar 
molecule, crosses? the molecular curve corre- 
sponding to the homopolar NaCl. This latter 
distance is more than 10A. Thus, for the energies 
available, the lowest curve is well separated from 
even the next curve. In the presence of an addi- 
‘tional Na atom, i.e., during the collision of the 
NaCl* with Na, both molecular curves will be 
deformed and an overcrossing cannot be alto- 
gether excluded. However, even if such an over- 
crossing should occur, it hardly would lead with 
an appreciable probability to transitions to such 
highly excited energy surfaces which correspond 
to an excited Na atom. 
It appears, on the whole, more probable that 
the original reaction Nag+Cl= Na+ NaCl* leads 
¢H. Beutler nd M. Polanyi, Zeits. f. Physik 47, 379 
(1928); Zeits. hysik. Chemie 1B, 3 eva St. v. 
Bogdandy and M. Polanyi, ibid. 1B, 21 (1928); M . Polanyi 
and G. Schay, ibid. 1B, 30 (1938). 
1C. F. Goodeve, Trans. Faraday Soc. 30, 60 (1934). 
8 F. London, Sommerfeld Festschrift (Leipzig, 1928), p 


104. 
? F. London, Zeits. f. Physik 74, 143 (1932). 
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to the homopolar state of the NaCl molecule 
rather than the normal state. The distance at 
which the Na¿+CIi curve crosses the Nas + CI- 
curve is greater than 6.7A so that there remains 
a large probability for a nonadiabatic behavior, 
i.e., for the transition to the higher, homopolar 
level. If this is true, the energy in NaCl is stored 
up to a great extent as electronic energy and it is 
well known that this can serve more readily than 
vibrational energy can, to the excitation of the 
electron in another atom. The weakest point of 
this explanation consists in the apparent absence 
of a reasonably deep minimum in the molecular 
curve of NaCl in the homopolar state. However, 
the evidence is not conclusive on this point. 

This example should only emphasize the fact 
that complications can exist even in the energy 
surfaces of very simple systems and that these 
can lead to rather unexpected results. On the 
whole, the status of the theory of reaction rates 
with activation energy is that the statistical part 
of the theory is well developed, but our knowledge 
of the forces between atoms is adequate only in 
a few cases. 


II 


The status of the theory of reactions without 
an activation energy is rather the opposite. The 
statistical part of the theory offers considerable 
difficulties, but the details of the behavior of the 
forces between atoms enter in a much less critical 
way than for reactions with activation energy. 
Nevertheless, the example to follow shows that 
they may form, even in this case, the most im- 
portant link of missing information. 

Perhaps the most simple among the reactions 
without activation energy are those leading to a 
recombination of atoms by collision with a third 
atom and I attempted, some time ago, to derive 
a formula for the rate of these reactions” and 
applied it to the rate of the reaction I1+1+X 
=],+X which has been measured by Rabino- 
witch and Wood." 

The general idea of the calculation is illus- 
trated in the schematic Fig. 1. In this, r is the 
distance of the two I atoms, p, the corresponding 
momentum, x a coordinate of the third body X, 

10 E, Wigner, J. Chem. Phys. 5, 720 (1937). 


u E, Rabinowitch and W. C. Wood, J. Chem. Phys. 4, 
497 (1936). 
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measured from the center of the two I atoms. 
The other 15 coordinates of the phase space are 


suppressed in the figure, for sake of simplicity. 


The rate is defined as the number of pairs of I 
atoms passing in unit time from the dissociated 
into the associated state. For large values of x, 
positive or negative, there is no doubt which 
part of the figure corresponds to associated and 
which to dissociated atoms: the former is the 
region in which the relative energy V(r) 
+3p,°/m, of the two I atoms is negative, the 
latter in which it is positive. (V(r) is the poten- 
tial energy of the I atoms with respect to each 
other, m, is their relative mass.) The region cor- 
responding to the associated state lies within the 
surface shown on the figure. The paths of the 
different systems are lines in the phase space and 
the reaction rate is the number of such lines 
crossing the surface and entering the associated 
region. At large values of x, the paths in the 
neighborhood of the surface are parallel to it 
and do not cross it since the relative energy of the 
two I atoms is a constant of the motion if the X 
is far away. This is only another expression of the 
fact that the association can only occur in the 
neighborhood of the third body X, i.e., for 
small values of x. 

There will be some paths which enter the 
associated region but leave it immediately after- 
wards—these, of course, should not be counted 
as associations. One will, therefore, draw the 
surface at small values of x in such a way that 
the number of paths crossing it shall be as small 
as possible. This condition, while not changing 
the shape of the surface at large values of x 
(where no crossings occur, anyway), will change 


it at small values of x. It will be such that no 
paths will cross it in one direction just to cross 
it again soon afterwards in the opposite direction ; 
all paths crossing it will stay inside while x 
increases to large positive or negative values. 
The number of paths crossing this surface will 
give the real association rate, while the number of 
paths crossing any other surface, e.g., the surface 
shown on the figure, will be greater than the real 
association rate. Of course, the actual calculation 
must be done for the 18-dimensional phase space 
—not for the schematic space of the figure. 

It is clear then, that the calculated rate should 
be greater than the observed one. Nevertheless, 
Table I taken from reference 10 and completed 
with the results of a similar calculation on Br 
atoms shows that this is not the case. The values 
are given in units of 107% (molecule/cc)”? sec.—!, 
the different columns refer to the different ‘‘third 
bodies” X used for the reaction. The computa- 
tion has been carried out exactly as described in 
reference 10. For the radius of Br, that of Kr, 
1.59A was assumed.” The rate of measurements 
for Br are also due to Rabinowitch and Wood." 


TABLE I, 
x He A H: N: O: 
Br exp. 0.76 1.3 2.2 2.5 3.2 
theor. 1.6 2.1 1.7 2.3 2.2 
I exp. 1.8 3.8 4.0 6.6 10.5 
theor. 1.5 2.1 1.8 2.3 2.2 
CI theor. 1.9 2.5 2.2 2.8 2.7 


12 Cf. Landolt-Börnstein’s Tables, Vol. I, p. 69. 
13 E, Rabinowitch and W. C. Wood, Trans. Faraday. Soc. 
32, 907 (1936). 
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It should be remarked, however, that the earlier 
work of Hilferding and Steiner!?® gave substan- 
tially lower results. 

Of course, the extremely large discrepancy for 
H, and N; is not alarming as it is not permissible 
to treat these as atoms. But a discrepancy 
obtains for the I association in the case of He 
and A also, and every improvement of the cal- 
culation makes this necessarily only worse. In 
fact, an improvement of the calculation, to be 
discussed later, decreases the calculated values 
approximately by a factor 2. 

Several possible reasons for this discrepancy 
have been discussed, among others the possible 
existence of other attractive states of the two I 
atoms, in addition to the one leading to the 
normal Iz molecule. It now seems that this is the 
cause of the discrepancy. 

One should remember that the system formed 
by two I atoms and a He atom can be in 16 dif- 
ferent quantum states, corresponding to the four 
possible orientations of the angular momenta of 
each I atom with respect to each other and to the 
internuclear axis. Only one-sixteenth of all pairs 
of I atoms will be in any specified one of these 
quantum states and ıf only one of these states 
were attractive, the association could occur only 
for one-sixteenth of all pairs of I atoms. In this 
case—and this was assumed at the calculation of 
Table I—the association rate would be 16 times 
smaller than it would be for 16 equally attractive 
states. 

E. Rabinowitch!* has considered the inter- 
pretation of his and Wood’s experiments in a 
very interesting fashion from the point of view 
of the simple collision theory. He pointed out 
that there are indications!® that among the 
molecular curves obtained by approaching two I 
atoms in their normal states, there are at least 
four which are attractive. One of these is simple 
and corresponds to the normal state of Is, the 
other three are doubly degenerate each but, of 

1% Zeits. f. physik. Chemie B30, 399 (1935). 

“ E. Rabinowitch, Trans. Faraday Soc. 33, 283 (1937). 

Cf. R. S. Mulliken, Phys. Rev. 46, 549 (1934); W. G. 
Brown, ibid. 42, 355 (1932); H. Cordes, Zeits. f. Physik 97, 
603 (1935). Note added in proof —Professor Mulliken has 
kindly informed me that he does not consider the evidence 
leading to the second lowest curve (4JI,,) of Cordes to be 
satisfactory. In fact, theoretical considerations lead him 
to expect this state to be repulsive. In this case, the 


theoretical values of Table II should be reduced by about 
30 percent. 
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TABLE II. 
He A He: N: 
I exp. 1.8 3.8 4.0 6.6 
theor. 6.1 7.7 6.4 8.1 
Br exp. 0.76 1.3 2.2 2.5 
theor. 4.25 5.4 4.5 5.9 


course, less strongly attractive than the first one, 
Rabinowitch concludes that no “statistical 
factor,” corresponding to our ṣẹ should be taken 
into account for the consideration of the associ- 
ation rate. This is, perhaps, somewhat exag- 
gerated, as there are certainly several repulsive 
states of two I atoms and these hardly can be 
expected to lead to a recombination. On the 
other hand, the rate of recombination into the 
three less strongly attractive states should also 
be calculated and added to the rate of recom- 
bination into the normal state of Iz. The values 
of the integrals over the three additional molec- 
ular curves of I, have been obtained by a graph- 
ical integration of Cordes’ estimated curves. 
It gave 3510-3 erg cm? for the integral 
—SfV(r)rdr and 129X10-44 erg cm‘ for the 
integral of — / V (r)r?dr for the three less strongly 
attractive states together. These numbers should 
be multiplied by two on account of the statistical 
weight of these states. The corresponding in- 
tegrals for the normal state of I; are 33.5 10756 
erg cm? and 114X10-* erg cm‘ if taken from 
Cordes’ curve and 271073® and 91X107, 
respectively, if computed by Morse’s formula, as 
outlined in reference 10. A similar calculation has 
been done for Br also, although on an even less 
secure basis. The rates calculated for the number 
of transitions into all four states are shown in 
Table II in the same units as in Table I. 
Considering the crudeness of the calculation 
and the fact that the rate could be calculated in 
terms of constants obtained from entirely dif- 
ferent sources, the agreement is as good as can 
be expected. The relation between calculated and 
observed values is as it should be; the latter ones 
are several times smaller than the former ones. 
Even a larger ratio between calculated and ob- 
served values would not be disturbing as the 
former ones must be, by the very nature of their 
derivation, too large and every improvement of 
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the calculation—e.g., the one to be outlined 
below—will bring the two values nearer to each 
other. 

The apparently better agreement for H: and 
N; has no significance as the formulas apply only 
for atoms. It is somewhat surprising that the 
ratio of the calculated rates for Br and I is so 
much smaller than the experimental ratio. 

An improvement of the calculation can be ob- 
tained on the basis of the following consideration : 
A molecule becomes stabilized even if its energy 
is greater than zero if it contains mainly rota- 
tional and only to a smaller extent vibrational 
energy.! Denoting the angular momentum by a, 
a diatomic molecule is (apart from quantum- 
mechanical effects) stable if its total energy is 
smaller than u?/2m,r,? and the separation of its 
atoms smaller than r, where m, is the relative 
mass of the atoms in the molecule and r, the 
distance at which V(r)+ y?/2m,r? assumes its 
maximum value. In this way, a molecule can 
accumulate substantially greater energy amounts 
than sufficient for its dissociation. This fact gives 
us the possibility to place the boundary dividing 
the free state from the associated state not at the 
place where the relative energy of the pair is zero, 
but at the place where, at a separation r<r,, the 
relative energy 4$71,v,2+ V(r) becomes smaller 

16 Cf. G. Herzberg, Erg. exact. Naturw. 10, 207 (1931). 
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than p?/2mr,?. A priori, both procedures are jus- 
tified. The more useful procedure will be, how- 
ever, that one which gives the smaller upper 
limit for the rate. Naturally, even the rate ob- 

tained with the new calculation will be only an 

upper limit for the actual rate, as the relative 
energy can rise above p?/2mr,? immediately after 
it falls below this value. The reason that the old 

calculation gave higher rates than the new one 
is that it considered the number of pairs of atoms 

with a positive total energy to have the equi- 

librium value. It has, however, a lower value in 

reality, since the above-described metastable 

states with positive energy are more quickly 

impoverished by collisions of the second kind 

than they are replenished by the processes which 

we are considering now. The old calculation 
counts as associations these collisions of the 

second kind also (which is correct) and assumes 

that the metastable molecules are present in the 

amount which is in equilibrium with the free 

atoms. It is the avoiding of this last assumption 

which constitutes the improvement of the calcu- 

lation as outlined above. 

It is clear, however, that the computation 
becomes much more involved and its details will 
not be given here. The calculated rates are 
reduced by almost a factor 2 and the agreement 
of Table II much improved. 


Discussion 


E. P. Wigner, Princeton University: I wish to 
make some remarks concerning the adiabatic 
hypothesis. The adiabatic hypothesis states that 
the atoms can be considered to move under the 
influence of a potential. This means, physically, 
that the electronic motion adjusts itself quickly 
enough to the changing positions of the nuclei so 
that no change in the electronic excitation occurs. 
Clearly, if the electrons become excited, the cor- 
responding excitation energy is unavailable for 
the motion of the nuclei, and these do not seem 
to move any more as under the influence of a 
steady potential. 

The conditions for the validity of the adiabatic 
hypothesis have been investigated repeatedly 
from the theoretical standpoint. I wish to point 
to some information, obtained on the basis of 
experimental evidence, concerning the validity 
of this hypothesis. 


The infraction of the adiabatic hypothesis can 
be visualized in the following way. Given two 
atoms in their normal states, there will be a 
potential between them. If one or both atoms are 
excited, the forces will be different and we have, 
therefore, several potential curves (or “energy 
surfaces” in case of more than two atoms). If we 
start out with a certain energy and the two atoms 
in their normal states, the excess energy is present 
as the kinetic energy of the nuclei. These will 
approach each other with increasing velocity at 
first, as long as they are in the attractivc region 
of the potential; then they will approach each 
other with decreasing velocity, come to amomen- 
tary standstill and separate again, finally at- 
taining the same velocity with which they col- 
lided. They will both remain unexcited. This se- 
quence of events will occur if the process is 
adiabatic. Assuming nonadiabatic behavior, 
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there will be a certain probability that one of the 
atoms is excited and the velocity of separation is 
smaller by an amount equal to the excitation 
energy. The nonadiabatic behavior is, therefore, a 
jump from one energy level to another. 

Knowing these energy surfaces, it is not dif- 
cult to give the approximate condition for 
adiabatic behavior. During the period h/E of the 
electronic motion the change in the potential V 
must be small as compared with the energy 
necessary to excite an electron: (h/E)vd V/dx <E. 
Landau and Zener developed an expression for 
the transition probability from one level to the 
other in the case of certain simple shapes of the 
potentials and this expression will play an im- 
portant part in the paper of Noyes and Hen- 
riques. It shows that the transition probability is 
a very sharp function of the quantities involved. 
It can increase by an extremely large factor for a 
comparatively small increase in.the parameters. 

The first set of relevant experiments which may 
be brought up in this connection concern the 
excitation of atoms by impact of other atoms. In 
this case E is relatively large; and we must expect 
adiabatic behavior. Although the experimental 
material is unfortunately rather scarce, it is 
evident that, in agreement with the expectation, 
the excitation by atomic impact has a very low 
yield. Beeck, Dopel, Gailer, and Holzer have 
performed such excitation experiments and the 
excitation seems to be quite unobservable up to 
velocities of about 150 volts where it sets in, ac- 
cording to Holzer, with a very small cross section. 

We know much more about the reverse process, 
i.e., the so-called collisions of the second kind, 
than we do about excitation by atomic impact. 
This process may be readily observed as the 
phenomenon of the quenching of fluorescence by 
foreign gases. In this case the system is first on 
the upper curve and goes over to the lower curve 
(when energy is plotted against internuclear 
distances). 

In the most simple cases the cross section is 
very low for this process, about 1/30 of the gas 
kinetic cross section. Both experiments on Hg 
and on Na illustrate this, if the luminescence is 
quenched with noble gases. This means, of 
course, that the cross section for the reverse 
process is extremely small according to well- 
known considerations using the principle of de- 
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tailed balance. It is smaller in the ratio of the 
kinetic energies before and after the collision, 
which is about 100 to 1. Thus the calculated 
cross section for the excitation process is only 
1/3000 of the gas kinetic cross section and the 
adiabatic hypothesis is well obeyed. However, 
the cross section for excitation increases strongly 
with increasing excess energy of the impinging 
atom. 

The quenching cross section for molecules is 
already very much greater than for atoms of 
noble gases. The smallest one is that of Nz against 
Hg and is 4 of the gas kinetic cross section. The 
other known cross sections are all gas kinetic or 
larger. Two circumstances must be remembered, 
however; first, that the cross section for the re- 
verse process, in which we are mainly interested, 
is about 100 times smaller—increasing, though 
slowly, with energy. Second, that the observed 
cross section is really the sum of very many cross 
sections since the electronic excitation energy 
originally present can be distributed in very 
many more ways among vibrational, rotational, 
etc. degrees of freedom in the final state than it 
could be distributed in the initia] state. Thus the 
adiabatic hypothesis is reasonably well obeyed in 
these cases also. 

The situation becomes radically different if we 
consider not transitions from the lowest to a 
higher surface, but transitions between a pair of 
closely spaced surfaces. The cross sections are in 
this case often greater than the gas-kinetic, 
cross section by a very large factor and Beutler, 
among others, has shown that they increase very 
strongly with decreasing distance of the levels. 
This is, of course, in qualitative agreement with 
our formula, but it says at the same time that 
under these conditions we are very far from a 
fulfillment of the adiabatic hypothesis. It is 
customary in this connection to point out that 
CO and O; which strongly quench Hg radiation 
have probably electronic excitation levels at 4.75 
and 4.68 volts, while the Hg excitation energy is 
4.86 volts. 

I think that these considerations show that the 
adiabatic hypothesis can be expected to hold in 
many cases—not, however, generally. In general, 
the more complicated a system is, the more 
electronic excitation levels will be present and the 
chances for a resonance will be correspondingly 
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increased. This can either hinder the reaction—a 
case which has been much discussed—or it can 
make reactions possible which would not be pos- 
sible otherwise. All cases of chemiluminescence 
belong to this latter class since, in these cases, 
there is an electronic excitation in the reaction 
products while there is none in the initial sub- 
stances. One group of such reactions already con- 
sidered in more detail—the chemiluminescence 
of Na induced by the reaction between Na and 
a halogen—has been carefully investigated by 
Polanyi and co-workers, especially Schay, and 
discussed recently very thoroughly from another 
point of view by Evans and Polanyi.” 


James Franck, University of Chicago: I do not 
understand why one should stress the difference 
between an adiabatic and nonadiabatic process. 
In chemistry the important thing is the relative 
positions of the nuclei with respect to each other. 

In the impacts between two particles, 4 transi- 
tion can occur when the potential curves 
(energy plotted against internuclear distance) 
come near to each other. A mixture of helium and 
mercury at high temperatures gives an emission 
of light, but at low temperatures helium does not 
quench excited mercury atoms. If the nuclei are 
close enough together, the curves will cross and 
electrons can move to a new State. 


John L. Magee, University of Wisconsin: In 
connection with the reaction 


Na+Cl—NaCl+Na* 


which Professor Wigner has discussed I should 
like to point out that the luminescence is to be 
expected even for an adiabatic process. This is 
seen when the splitting of the energy surfaces due 
to effect of the p state of the chlorine atom is 
considered. The reaction occurs in a plane since it 
involves only three atoms, and so the electronic 
states of the reaction system are classified as 
even or odd with respect to this plane. This sym- 
metry is preserved throughout the reaction. The 


11 M. G. Evans and M. Polanyi, Trans. Faraday Soc. 35, 
178 (1939). 
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resulting NaCl molecule is in a E state which is 
even. Thus for reactions which occur in an odd 
state the sodium atom which is formed must be 
excited. The even state is doubly degenerate, and 
thus for adiabatic behavior one of these states 
will also lead to excitation of the sodium atom. 
Professor Wigner points out that the surface for 
the system Na,+—Cl- crosses the surface for 
Naz—Cl at a large distance and the other even 
state which would adiabatically lead to a normal 
sodium atom is probably nonadiabatic due to 
this crossing. 

The relative importance of these three surfaces 
for the reaction will depend upon the relative acti- 
vation energies. At any rate the luminescence is 
not very surprising, since two of the possible 
electronic states must lead to an excited sodium 
atom adiabatically and the third most likely 
yields the same products. 


Gregory Breit, University of Wisconsin: The 
motion of the nuclei is just as important as the 
shape of ‚the energy surface in determining 
whether or not the reactions are adiabatic. 


E. P. Wigner, Princeton University: 1 think 
that the point of view which I presented is very 
similar to Professor Franck’s. Indeed, the pur- 
pose of the approximate formula which I men- 
tioned is to provide a more quantitative back- 
ground to the phenomena which were just 
brought up. There are only two minor additions 
which I would like to make to his remark: First, I 
believe that it is very infrequent that the lowest 
energy level is crossed or even very strongly ap- 
proached by another one and that such a crossing 
or very strong approach, whenever it occurs, is 
due to special conditions such as different sym- 
metry properties of the crossing levels or radic- 
ally different electronic arrangements. Second, I 
do not think that such crossings are absolutely 
necessary for nonadiabatic behavior. This will 
be likely to occur whenever the above-mentioned 
inequality is infringed. In practice, of course, this 
will happen most easily if E is small, i.e., if the 
levels come very close to each other. 


On the Calculation of the Distribution Function 


C. W. Ufford and E. P. Wigner 


Physical Review 61, 524-527 (1942) 


(Received January 22, 1942) 


The distribution function for particles confined to a large circle is found for a repulsive 
potential of long range. The distribution function is calculated exactly from the Boltzmann- 
Gibbs equation, and compared with the solution of an implicit equation of the type used by 
Debye and Hückel in the theory of electrolytes. The solution of the Debye-Hückel equation 
agrees, for our particular potential, only fairly well with the exact distribution function. 


I. 


LTHOUGH the problem of classical statis- 
tical: mechanics is, in principle, solved by 
the Boltzman-Gibbs equation 


P(x, Yis 21, °° En; Va, Zn)dX1° $ -dz, 


=exp (—BV)dx,---dz, (1) 


for the probability of the configuration charac- 
terized by the rectangular coordinates x1, Yı, 21, 

"+, Xas Yn, Zn, the answering of questions of 
immediate physical interest meets, in most cases, 
serious mathematical difficulties. In (1), B=1/kT 
and V is the potential energy 


V=% v(|ri—rel), (1a) 
i<k 


which we shall assume to be the sum of inter- 
actions between pairs. | 

Many of the quantities of immediate physical 
importance depend on the distribution function 
g(r), i.e., on the probability of a distance r 
between, say, the particles 1 and 2. This is 
given by 


e(Irı—r2|)=const. f- . f exp (-ßV) 


Xdxsdysdzs: - -dxndyndzn. (2) 


However, one can evaluate (2) easily only in the 
case of dilute matter, i.e., gases. For condensed 
material, the integrations in (2) are so difficult 
that indirect methods had to be devised for the 
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evaluation of the distribution function. These 
methods have two features in common: (a) they 
do not give g directly but only as the solution of 
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DISTRIBUTION FUNCTION 
given by g(x’)dx’v(x—x’). Hence 
U(x) =v(x) + Í 2(x')u(x —x' )dx’ (4a) 


an implicit equation the solving of which, though 
in most cases not very easy, is still less difficult 
than the integrations of (2); and (b) the implicit 
equation for g is not exact, i.e., its solution is 
only an approximation to the rigorous expression 
(2). The first implicit equation was given by 
Debye and Hückel! in their theory of electrolytes. 
More rigorous equations are due to Kirkwood? 
and Mayer.’ 

The present authors became interested in these 
methods because integrals of the type (2) occur 
in the theory of the so-called correlation energy. 
The present note deals with a special problem in 
which all particles are confined to a large circle 
of length L; and in which the potential between 
a pair of particles is given by 


Bu(x) = — In sin? rx/L, (3) 


where x is the distance between the particles, 
measured along the circle. We found that for this 
interaction the distribution function can be 
evaluated exactly by (2), and that the implicit 
equation corresponding to Debye and Hiickel’s 
theory also can be solved. The comparison of the 
two results gives an indication of the accuracy 
of the Debye-Hiickel equation (5) for the case 
of a repulsive potential of long range. In the case 
of long range forces the Debye-Hiickel equation 
can be expected to give good results. 

In the Debye-Hiickel theory the distribution 
function is given, apart from a constant, by an 
average potential U 


In g(x) = —BU(x)+C. (4) 


The average potential contains the potential 
v(x) of the original particle around which we 
investigate the distribution of the others, and the 
average field of the other particles. There are 
g(x’ )dx’ particles at a distance x’ from the 
original particle; and their potential, at x, is 


ee and E. Hückel, Physik. ne 24, 185, 305 
(1923); P . Debye, Physik. Zeits. 25, 97 (1924). 

.G. Kirkwood as Elizabeth Monroe, J. Chem. Phys. 
ish) (1941); J. G. Kirkwood, J. Chem. Phys. 3, 300 


3J. E. Mayer and E. Montroll, J. Chem. Phys. 9, 2 
SP todd. E. Mayer and S. F. Harrison, J. Chem. Phys. 6, 


and the implicit equation becomes 
DT, f g(x ole a)dx+C. (5) 


The constant C is determined by the condition 
that the integral of g must be n—1, 


[s@de=n- 1. 


In most cases g is a constant if x is large compared 
to the average distance between neighboring 
particles. One can, therefore, substitute 


g(x) =go(1+h(x)), (6) 


in which h(x) vanishes for large x. Substitution of 
(6) into (5) gives a somewhat different form to 
the implicit equation 


In (1-+h(x)) = — Bo(x) 
— 508 | hapla- dr +C, (6a) 


(5a) 


since the integral of gw(x—x’) is independent 
of x. 


II. 


The calculations remain to be done. For 
evaluating (2) we can write 


exp (—8V) = [I sin? x(x;—x,)/L 
ick 
= 2-"(a-DITJ (exp 2rix;/L—exp 2rixı/L) 
ick 
Xexp (— wi(xs+xx)/L)|?. (7) 


The factor exp (-ri(x;+x.)/L) may be omitted 
since its absolute value is 1. If we write 


w;=exp 2rix,/L, (7a) 


we get 
exp (—6V)= 


the product on the right can be written as a 
[Aj] where 


2-1) TI (we—we)|2, (7b) 
ick 
determinant A= 


An=w,'!=exp 2rilx;/L (1=0,1,2, ---n—1). (8) 
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This gives 


exp (—BV) =2-"O-DAA*, (9) 


The determinant |Aj| is, in case of odd n, apart 
from a factor exp —7i(n—1)(xit---+x,)/L, 
the wave function of a one-dimensional de- 
generate Fermi gas. 

One can integrate (9) immediately by develop- 
ing both A and A* with respect to the two rowed 
minors of the first two rows. The minor of the 
lila columns in A is 


exp Qri(lyxitlex2)/L 


—exp 2ri(lexitdixe)/L. (10) 


Its cofactor is orthogonal to the cofactor of every 
two rowed minor of A*, excepting the cofactor of 
the minor of the l,l columns. The product of the 
cofactors of the /ıla minors of A and A* gives 
(n—2)! if integrated over x3, ++, Xn. Thus 


feof exp Carnie as, 


= ) 27" (a —2)!| exp 2rt(hixitlexe)/L 
Iı<iz 


—exp Qmi(loxi-+lyxe) /L | 2. (10a) 


Thisis, according to (2), proportional to g(x: — x2). 
The sum in (10a) can be readily evaluated and 
gives, with (5a), 


(x) “(1 sin? =) 
ua US 


n? sin? rx/L 


n sin? rnx/L 
-— (1 -— —): (11) 


a’n°’x?/L? 


The last expression is valid if x remains of the 
order of magnitude of the average distance of 
two particles, i.e., is not very much larger than 
L/n. Equation (11) gives the rigorous expression 
for g; it is the lowest curve in Fig. 1. The 
abscissae in this figure are nrx/L so that the 
average distance of two particles corresponds to 
the abscissa r in the graph. The ordinates are 
multiplied by L/n. 

That the solution of the Debye-Hiickel Eq. (5) 
is different from (11), can be seen by inserting 
(11) and (7) into the right side of (5). If g’(x) is 
the valuc of g(x) obtained by these substitutions, 
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DENSITY 
Œw 


DENSITY AVERAGE 


l 2 
Y= 7x DISTANCE AVERAGE DISTANCE 


_Fic. 1. Distribution functions. The ordinates are the 
distribution function g(x)/(#/L) representing the density 
of particles divided by the average density. The abscissae 
are y=xnx/L, = times the distance divided by the average 
distance of nearest neighbors. The distance is equal to the 
average distance for the abscissa y =r. Curve A represents 
the Apoo value of the distribution function 2) (n/L) 
given by (11) from the Boltzmann-Gibbs equation. Curve B 
represents the distribution function, g’(x) of (12a), obtained 
by T i the rigorous value (11) in the right-hand side 
of the Debye-Hückel Eq. (5). Curve C represents the 
distribution function (13) obtained by solving the Debye- 
Hücke! equation. 


this gives 
sin 2rnx/L 
In g'(x) = -——— + 2Ci(2rnx/L} (12) 
Tnx 
or 
sin 2rnx/L 
g’ (x) =exp |-—— + 2ci(aene/) | (12a) 
anx/L 


g'(x) is the uppermost curve in Fig. 1. If (11) 
were a solution of (5), the value of g’(x) in (12a) 
would be equal to the rigorous value of g(x) 
given by (11); and the uppermost curve of Fig. 1 
would coincide with the lowest curve. One sees 
that this is far from being true. The evaluation of 
the integral obtained by inserting (11) and (3) 
into (5), i.e., the derivation of (12), requires a 
somewhat lengthy calculation and will not be 
given in detail. Ci in (12) is the cosine integral as 
defined in Jahnke-Emde.‘ 

One would expect, on the basis of the lowest 
and highest curves in Fig. 1, that the solution of 
(5) would be represented by a curve similar to 
the middle one of the figure. This corresponds to 


g(x) = (13) 


n 
ie I-———) 
L ( 1-+n? sin? rx/L 


1E. Jahnke and F. Emde, Tables of Functions (B. G. 
Teubner, 1933), second edition. 
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and the following calculation shows that (13) is 
indeed the solution of (5) for the potential (3). 
In order to show this, one can first verify (5a), 
using the assumption n>>1, and then calculate 
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taken in (13e). We now have 
I= —2n In 2+1n 2?—In (1+?) 
1 fr? Iny”? 
_— f 4 dy’, (13f) 
rJ_.1+-+y"”? 


i= -6 [st Ivta- "Jia 


n pt (: 1 ) 
L oo 1+2 sin? rx'/L 


XIn sin? r(x—a’)/Ldx’. (13a) 
The first term is a well known definite integral; 
its value is —2n In 2. In order to integrate the 
second term, one can change the limits of integra- 
tion and integrate from —4Z to 4L. Then the 
first factor will be very small, except for very 
small x’ for which the sine can be replaced by its 
argument. Thus (13a) becomes 


n ci In sin? r(x—x’)/L 


I= —2n In 2-— —d'. 
L J-L 1+m?’n?x'?/L: (13b) 


If x is of the order of L, the remaining integral 
will have two maxima: at x'=0 and at x’=x. 
However, the integral over the second maximum 
is inversely proportional to n and can be neg- 
lected. The integral over the first maximum 
becomes, if one replaces x—x’ by x in the 
numerator 

I= —2n In 2—In sin? rx/L 


for <>L/n. (13c) 


If x is not large as compared with L/n, the sine 
can be replaced by its argument in (13b) since 
the denominator becomes very large for x’>L/n. 
This gives, with y=rnx’/Landy=rnx/L, 


1 pr? In (y—y’)?/n? 
I=—2nIn 2—- f ee 
wd _« i++’? 
1 pr" In (y—y’)? 
= — 2n li 2+ nm f ee (13d) 
TYI ity’? 
and 
oI 2 ¢* dy’ 2y 
M E Al 
ðy ww (ity)(y—y’) ity? 


since the main value of the integral has to be 


In the last integral the lower limit can be re- 
placed by 0 if a factor 2 is inserted. Substitution 
then of z for In y shows that the integral in (13f) 
vanishes and we have 


= — 2n In 2+1n n®—In (14+ m?n?x?/L?) 


for x&L. (13p) 


Since n>1, 


I= -6 [sed = —2nIn2 
+In n?—In (1+? sin? rx/L) (14) 


holds for both regions (13c) and (13g). Insertion 
of (13), (14), and (3) into (5) shows that (5) is 
indeed satished by (13). C in (5) has the value 
In 2?*z/L. The solution (13) of (5) is shown in 
Fig. 1 lying for small abscissae between the other 
two curves. 


III. 


One sees that the solution of the Debye: 
Hiickel equation agrees fairly well with the 
correct distribution function if the potential is 
given by (3). In particular, the total volume of 
the “hole” in the distribution function is 1 in 
both cases. Nevertheless, (5) gives too high a 
probability of a close approach of two particles 
since (13) is too high for small abscissae. The 
failure of (5) in this respect could have been 
foreseen and depends only on the repul- 
sive character of our potential (3). What is 
essentially neglected in (5) is that-the particle 
under consideration, particle 2 in our notation, 
has an effect on the rest of the particles. In our 
case, it pushes them away from itself so that the 
distribution of the particles 3, 4, ++», n will, in 
reality, show a hole around the point where 
particle 2 is. Thus particle 2 will be under a 
lower potential than the average potential U 


given by (4a). However, the difference between, 


the actual and the average potential is smallest 
for small x because, where the second particle is 
close to the first, there are very few particles to 
be shoved away from its neighborhood. 


Electromagnetic Field Expansions 
in Loss-Free Cavities Excited Through Holes” 


T. Teichmann and E.P. Wigner! 


Journal of Applied Physics 24, 262-267 (1953) 


(Received April 30, 1952) 


It is shown that the electromagnetic field in a loss-free cavity excited through holes cannot be completely 
expressed in terms of the short-circuit modes of the cavity satisfying the condition that the tangential com- 
ponent of the electric field is zero on the boundary of the cavity including the openings. For a complete 
expansion it is necessary to add an irrotational magnetic field, which contributes a term inversely propor- 
tional to the frequency, to the usual admittance matrix. If the cavity is presumed to include a reasonable 
portion of the guides feeding the openings, this irrotational component becomes almost diagonal. 


I. INTRODUCTION 


i the course of comparing some similar aspects of the 
theory of nuclear reactions and of electromagnetic 
transmission through loss-free microwave junctions, the 
writers were led to a re-examination of the problem of 
expanding arbitrary electromagnetic fields in these junc- 
tions in terms of normal modes. 

The case of a completely closed cavity with perfectly 
conducting walls was disposed of by Weyl! many 
years ago. Using the theory of linear integral equations, 
he showed that an arbitrary electromagnetic field satis- 
fying homogeneous short-circuit boundary conditions 
on the perfectly conducting surface S of a hollow simply 
connected cavity V (i.e., a field E, H satisfying nx E=0 
on S, n being the unit normal to S) can be completely 
expanded in terms of the normal short-circuit modes of 
the cavity. These modes satisfy the equations 


curlE. em jwala 


in V 1.1 
curlH.= Jaba (1.1) 

* This work was assisted in part by the U. S. Atomic Energy 
Commission and the Higgins Scientific Trust Fund. 

t Now at the Research and Development Laboratories, Hughes 
Aircraft Company, Culver City, California. 

1H, Weyl, J. Math. 143, 177 (1913). 

2 H. Weyl, rend. circ. mat. Palermo 39, 1 (1916). 


and the boundary condition 
nXE,=0 onS. (1.2) 


Throughout this discussion one is, of course, dealing 
with the temporal Fourier transforms of the actual 
fields (j=./—1). 

Weyl attacked this problem mainly to study the 
asymptotic distribution of proper frequencies. In recent 
years the expansion aspect of the problem has acquired 
new interest, because of the large part played by 
cavities (with holes, however) in modern microwave 
system,’4 and also because of attempts to describe 
stellar electromagnetic fields in terms of normal modes.® 

In the first of these problems, which is the one of 
interest here, it usually proves convenient to separate 
the effect of the guides and of the junction proper 
(see Fig. 1). The field in the guides is characterized 
by the various guide modes, while in the junction proper 
it must be characterized by some compleie set of vectors. 
The junction couples the various modes and guides, and 
the generalized “currents” and “voltages” in the guides 
are then related by the so-called admittance matrix, 

8 E. U. Condon, Revs. Modern Phys. 14, 341 (1942); J. C. Slater, 
Revs. Modern Phys. 18, 441 (1946). 

1 K. Franz, Elek. Nachr.-Tech. 21, 8 (1944). 


5 W. Elsasser, “Boundary-Value Problems of Hydromagnetics” 
Tech. Rept. ONR Contract N-7-onr-388 (Sept. 1950). 
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which is a function of frequency in which the proper 
frequencies of the junction play a basic role.?® It seems 
to have been generally assumed that the short-circuit 
modes of the junction, regarded as a closed cavity, 
form a complete set with respect to an arbitrary admis- 
sible electromagnetic field (excited by the wave-guide 
fields through the openings).. While these modes can 
indeed be made basic to any such expansion, they are 
not complete. Instead, it is necessary to add an irrota- 
tional magnetic field; similarly, in the case of open- 
circuit modes (i.e., modes satisfying the boundary 
condition nX H=0 on S) it is necessary to add an irro- 
tational electric field in order to permit expansion of the 
relevant fields. Since these facts do not appear to be 
widely known, this paper is intended to elucidate them 
and to estimate the effect of the additional field on the 
admittance matrix. 


II. CHARACTERISTIC FIELDS OF THE CAVITY 


The problem one has to consider is how to char- 
acterize the electromagnetic field adequately in a 
hollow simply connected cavity with perfectly con- 
ducting walls, which is excited via holes out in these 
walls. 

The simplest way to attack this is to introduce a set 
of orthogonal vectors (in Hilbert space) and add further 
such vectors until the set is complete, i.e., permits an 
expansion of all the fields of interest. The natural 
starting point is, thus, the set of normal vectors, satis- 
fying short-circuit boundary conditions on the entire 
“surface” S of the cavity, consisting of the conducting 
portion S!, and the holes (wave-guide openings, hence- 
forth called terminal planes) Tı, T2, ---, assumed to 
be plane. The total open area is denoted by T= {m 
Tu=T\+T.+:::. These normal modes thus satisfy 


curlE. = — jwoHe 21 
curlH.= jw«Ea ay 


in the cavity V, and the boundary condition 


nXE,=0 (2.2) 


on S. The wa are real, and there is no loss of generality 
in taking Ea real, and Ha imaginary. Ea, Hae may be 
normalized to satisfy 


f (E.-E,—He-H,)dV = ben. (2.3) 
vV 


The solutions occur in pairs we, Ea, Ha, and wa = —wa, 
E_.=E., H_.= — Ha. It will generally prove conven- 
ient to combine these. 

It is clear at this stage where the difficulty may arise. 
The above equations assert that the Ea are the proper 


A comprehensive treatment of the background material is 
given by J. C. Slater, reference 3. A- treatment giving the notation 
and some of the results stated below in more detail may also be 
found in T. Teichmann, J. Appl. Phys. 23, 701 (1952). 


vectors of the boundary value problem 
AE.+waE.=0 (2.4) 


with the boundary condition (2.2). If w.20 this implies 
that the H. are the proper vectors of the problem 


AH,.+ w7H.=0, (2.5) 
with the boundary condition 
(n-H.)=0 onS. (2.6) 


This last condition follows from Eqs. (2.1) and (2.2) if 
wa 0, but not if w.=0. In this latter case, neither (2.6) 
nor the vanishing of the divergences of H and E follows 
from (2.1) and (2.2). However, since their curls vanish, 
they can be represented as gradients of scalars, 


(2.7) 


The zero-frequency fields (2.7) appear to lack physical 
reality, and it is customary to assume that they are not 
needed for the expansion of the actual field E, H in the 
cavity. That this is not so will be demonstrated in two 
steps. We shall show, first, that the Es, Hg of (2.7) is 
orthogonal in the sense of (2.3) to every Ea, Ha field 
with wa70, as long as w assumes a constant value on the 
surface S of the cavity (u can be arbitrary). The second 
step will show that a 


E,=0 Hs=gradu 


Es=gradw H,=gradu. 


(2.8) 


field is, in general, not orthogonal to the actual field, 
although a 


E,=gradw He=0 (2.9) 


field is. Hence, the expansion of the actual field will in 
general contain a term of the character (2.8). | 

The first step will be carried out first for the field 
(2.8). We have 


f H,-H.dV=— f H,-curlE.dV 
We 


=] f div(E.XH,)dV-+ f Ea: curlHgay |. (2.10) 
Wa 


However, the second term of this vanishes because Hg 
is a gradient, and the first term is equal to the surface 
integral of 


(EX Hs) -n= (1X Ea): Ha, (2.11) 


which vanishes by (2.2). A similar calculation for 
S Es: EdV leads to the surface integral of (HX Ep) -n 
which will vanish only if (HeX Eps) is parallel to the 
surface S. Since Ha can be any vector parallel to S, 
(H. X Es) is parallel to the surface S. Since Ha can be 
any vector parallel to S, (H.XxE;) will be parallel to 
S only if E, is perpendicular thereto, i.e., if w is constant 
on S. The condition on E, is, therefore, the same (2.2) 
as on E.. 
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The second step of our demonstration is very similar 
to the first one, with the difference, however, that the 
actual field E, H replaces Ea, Ha. We can assume, with- 
out loss of generality, that the actual field E, H is 
periodic with a finite period w (this excludes static 
fields), this then replaces we in (2.10). The transforma- 
tion (2.11) finally gives 


f H,- Hdv =} f (nxE)-Hsds. (2.12) 
v es 


This is not zero in general because nXE vanishes only 
on the conducting portion S? of S, not on Ti, Tz, -+-.On 
the other hand, a similar calculation with the field 
(2.9) leads to the surface integral of 


— (naX H). Es= (nX Eg) -H, (2.13) 


and this vanishes because Eg is perpendicular to S. 

The component (2.8) is most significant in the de- 
scription of the actual field if the linear dimensions of 
the cavity are small as compared with the wavelength 
of the radiation which one wishes to describe, i.e., for 
which one wishes to obtain the admittance. In general, 
the significance of the static component (2.9) can be 
best gauged by means of the final formula and this will 
be done following (3.35). Naturally, no (0, gradu) term 
would be necessary if one wanted to expand only fields 
which can exist in the cavity S the walls of which are 
actually all conducting. However, this is not the object 
of the admittance matrix. 

One could avoid using the static portion (0, gradu) 
by replacing the boundary condition (2.2) by an im- 
pedance boundary condition relating the tangential 
components of E, and Ha, viz., 


(nXE.)= U(nX Ha), (2.14) 


where N is a two-dimensional dyadic’ with zero trace 
(to insure orthogonality). For X0, the boundary 
value problem thus defined has a double infinity of 
proper solutions (as compared to the case A=0), be- 
cause the boundary conditions have, as it were, an 
additional degree of freedom. In the limit 2{=0, these 
normal modes coalesce in pairs to give the normal 
modes of the original problem with short-circuited 
boundary conditions. The field gradu is thus required 
to make up for this deflection of half the modes. How- 
ever, this method appears rather artificial. | 


II. EFFECT OF THE IRROTATIONAL COMPONENT 


In light of the considerations of the previous section, 
the electromagnetic field E, H, excited in the cavity 


7 We have been informed that boundary conditions of this na- 
ture have been considered already by J. Schwinger in an as yet 
unpublished article. 
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may be represented by the series 
E= 2 J alias 
(3.1) 


H=} faHa+gradu. 


If this field is of frequency w, E, H satisfy the equations 
culE=—jwH divE=0, 


curlH = jwE divH=0. (3:2) 
Thus, 
div(EXH.-E.xXH)= j(w—w.)(E-E.—H-H.) (3.3) 


and application of-Gauss’ theorem, and the orthonor- 
mality relations (2.3) leads to the result 


j(wa—) fa= Í H.a- (EXn)dT (3.4) 


[since (EXn)=0 on S']. The fa are therefore completely 
determined by the frequency and the transverse com- 
ponent of E on the terminal planes. To determine u, 
one notes that Au=0, so that u is completely deter- 
mined by the value of its normal derivative over S. 
Since n- gradu=n-H on S, one has 


u(p) = f N(b,g)(n-H)(9)aT(q), 8.5) 
T 


where p, q are points of V or S, and N(2, q) is the Neu- 
mann function? of the cavity, determined as follows: 


(i) N(p, 9) = (4rr p) +0, q) 
with v(p, q) regular and harmonic (i.e. Av=0) in V; 


a) f wo gas@=0; 
S 
ƏN(p,g) 1 en 
(iii) ee. eas 
IN, S 


In the last equation S denotes the area of S. 

In order to estimate the relative contribution of the 
irrotational component, one proceeds most conveniently 
by expressing E and H on the terminal planes Ty in the 
“canonical” form, i.e., in terms of the electric and mag- 
netic modes of the wave guide, which has the terminal 
plane in question as cross section. Thus one writes. 


E= > 2 UveF .netineZ Nels, Nes 


(3.7) 
H= > 2 in:Gr, NetineZve Gz, Ns 


8S. Bergman and M. Schiffer, Duke Math. J. 14, 609 (1947). 
This article also contains an account of various properties of these 
Neumann functions which are used below. See also O. D. Kellog, 
Foundations of Potential Theory (J. Springer, Berlin, 1929), 
Chap. IX. Neumann’s function is also called Green’s function 
of the second kind. 
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Fy., Gy, are the electric and magnetic fields of the 
sth modes in the Nth guide (i.e., on the Nth terminal 
plane or “hole” T y), and the suffixes /, z denote the trans- 
verse and longitudinal components of these fields (with 
respect to the normal Ty). Zn, is the impedance of this 
mode. A full description of these modes may be found 
in Condon’s work (reference 3); the properties relevant 
to the present discussion are listed below. 

When we introduce the scalar solutions Uy, of the 


equation 
(grad, w)*Uws+Kv2Un.=0 (3.8) 


on the terminal plane Ty, subject to the boundary 
conditions 


(grad; vUns)n=0 sodd, s=1, 3,--- (3.8m) 
Un,=0 seven,s=2,4,--- (3.8e) 
on the boundary cy of Ty, and placing 
ky? =w — Ky,’, (3.9) 

the wave-guide modes are given as follows. 
Magnetic modes, s odd 

G; v.= grad, Uy, 

Ga va= —(GKwekv Unsen 

F, va= (en XG: ys) 

F,n.=0 


Zn wm”. (3.11m) 


(3.10m) 


Electric modes, s even 
F, v= grad, vU ya 
F, ve= - (jKndkns!)Unsen 
Gi va= (F, y. Xen) 
G,n.=0 


Zus = Ian. 


(3.10e) 


(3.11e) 


en denotes the unit normal to the terminal plane Ty. 
It is directed away from the cavity and is equal ton on 
Ty. Fy, and Gy, are defined to be zero on all parts of T 
except Ty, where they are given by the previous equa- 
tions; hence they satisfy the orthonormality relations 


Five Fe udT1 


TL 


Í Gine GumdTı= 
Ty 


= | (Fin.XG, mw,)-e1dT,z (3.12) 
TL 


= ÔLMÔN More, 


the Uy, being normalized by 


(grad, nUy.)*dTy=Kne 


TN TN 


UnsdTy=1. (3.13) 


The Uy, may be taken to be real without introducing 
any essential restriction. 

Putting the expression (3.7) for E into Eq. (3.4), and 
using the above results one finds that 


fa= 2, L YaNsUNo (3.14) 
Wo w N isa 
where 
Ven=—j f H.-G.ndTy (8415) 
TN 


is real and independent of frequency. Similarly, com- 
bining Eqs. (3.7) and (3.5) yields 


NG, g)Uwalgd Tn o| 
(3.16) 


oe > 2 tw Kri 


TN 


2, means that the sum is to be taken only over odd 
values of s, since only the magnetic modes contribute 
to the normal component of H at the terminal planes. 
The factors 


ES f Neo Ur (DATO | 


are independent of the frequency w, so that for fixed 
values of the voltages vy., u(p) varies as WI. 

Using these values for the f. and u(p) in the expan- 
sion (3.1) of H and comparing this with the expression 
(3.7) which holds for the same quantity on the terminal 
planes Ty, it follows simply that 


H-G, mdTu=imr 


lm 
=, > (A Mr, netCmr,Ns)UNe (3.17) 
N a 
=} > YM, NIN» 
N 6 
where 
P YaMrYaNs f YaMrYaNa 
A MrNe= Jj 2 —— = 2jw L —. (3.18) 
a Wamu a>0 War mM u 


The last part follows from the remark following Eq. 
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(2.3). Similarly, 
Gmr, Ns = — jo Kye 


xX aT u(p) grad, mU mhp) $ grad, M 


TM 


X f cTn()N(d, )Un.lg) (3.19) 


TN 


for r, s both odd, and zero otherwise. A direct applica- 
tion of Green’s theorem enables Cmr, ye to be written in 
the more symmetrical form 


Cur, N= jo Kur Kyne 
x f dT x(p)dT x(g) 
Tu’ Ty 


XU u(p)N(p, 9) Un.(q). (3.20) 


In matrix notation one may therefore write the equation 


i= Yr, (3.21) 
in which the admittance matrixt 
Y=A+C (3.22) 


is composed of two parts, the contribution A of the 
cavity short-circuit modes and the purely inductive 
(irrotational) part C. 

Before proceeding to estimate the matrix elements of 
C and to compare them with those of A, it is instructive 
to show how these two terms contribute to the total 


energy 
Wu= f IH]2av. (3.23) 
Vv 
From Eq. (3.1) and the results of Sec. II, 
W=% fal f | gradu|?d V. (3.24) 
a v 


When Eqs. (3.14) and (3.18) are used the sum on the 
right is given simply by 


= >> > 3 „2 vv,Ans, MrUMr*s 
N e Mr 


or in matrix notation 


> | fa|?= — juAr*. (3.25) 


The dot denotes differentiation with respect to w, 
f=df/dw, etc. The contribution of the irrotational 


ł Because the transverse components of the wave-guide modes 
have been defined to be independent of frequency, the admittance 
matrix here derived does not contain the inconvenient factors Z? 
which occur in reference 6 and introduce an unnecessary additional 
frequency dependence which is quite independent of the proper- 
ties of the cavity. 
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part is 
f | gradu|2dV 

Y 


=y? 2 ee > > wv KndKur 


aTx(g)N(p, 9)Uwe(g)- grad 
TN 


x f dT u(y)N(, Uns). 
TM 


xf dV (p) grad 


It is a quite simple consequence of potential theory that 


f dV(p) gradN(p, g)-gradN(p, y)=N(q, y) 


(see reference 8, for example), and hence the expression 
on the right reduces to 


=)%L >> 2 2 UNsC Ns, MrUMr*, 
Ns Mr 
or, in matrix notation 


f |gradu|?dV = — joCo*. (3.26) 
14 
Consequently, 
W= — jo(A+C)o*. (3.27) 


If the excitation of the cavity is such that there is no 
normal magnetic field over the terminal planes C will 
not contribute anything to W, because the vs for 
which- C is finite are then zero. 

In order to calculate C m», n, exactly it is necessary to 
know the Neumann function N(d,g) of the cavity, 
and to carry out the double integration on the right of 
Eq. (3.20). Both the determination of N(Ż, q) and the 
subsequent integration may generally be expected to be 
rather laborious. However, if suitable assumptions 
are made about the position of the terminal planes 
Ty, Cmr, Nne may be estimated without too much diff- 
culty, and the approximate form turns out to be rather. 
simple. 

One therefore supposes that the terminal planes Ty 
are displaced outward by an amount dy from the 
“natural” boundary of the cavity, along the wave guides 
which they generate. The cavity now consists of the 
“natural” cavity plus sections of guide of length dy 
(see Fig. 1). The calculation now rests on the fact that 


N(p, )Uw.(Q)dTn(Q) 


TN 


one(p) = (3.28) 


is the potential at the point » due to a normal field 
Un.(g) over Ty, and zero elsewhere on S. In the region 
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between the natural boundary and Ty, it is possible to 
determine $x,(P) directly, by using the fact that the 
Un, form a complete set of solutions of the boundary 
value problem [given by Eqs. (3.8) and (3.8m) ], and the 
well-known potential theoretic results that-¢y.(p) can- 
not have any maxima or minima in V, but only at the 
boundary. A simple calculation then yields the result 
that 


nip) = Ku (1— One EN dny 


X (ENN Oy ,e?KNedNgKNneN)U,,, (3.29) 


Here zy denotes the distance of p from the terminal 


plane Ty. @y, is a constant of absolute magnitude less 
than unity, 


|@v.| <1. (3.30) 


It then follows trom the formulas (3.20) and the ortho- 
normality relations (3.12) and (3.13) that 


j 
Cyr, N” —-Kyör : (1 _ One ?EN:an)-1 


u) 
X(1+0y,e?ENin), (3.31) 
It is clear that if dy is made sufficiently large 
J 
Cnr, Ne= — -K Nara. (3.32) 


w 


The extension to matrix elements Cm, n, with MN is 
now also plain. If Curr, vs’ represent the matrix elements 
of C when the terminal planes Ty coincide with the 
natural boundary, then (3.29) shows that 


Op—(K MrdaM+KNedy) | 


Cmr, NC ur, Ns (3.33) 
For large dy and dy these matrix elements also become 
negligible. Hence, if the cavity is defined to include a 
reasonable length of each entering guide, one has 


(3.34) 


J 
Cmr, Ne -Knsömnör:. 
U) 


Since C has a frequency dependence of the form 1/w, 
while A has an infinity of poles extending to infinity, 
it follows that C can contribute appreciably to Y only 
for low frequencies (small w). This is, of course, con- 
sistent with the interpretation of C as the manifestation 
of the quasi-static part of H. 

The above considerations show that the usual ad- 
mittance matrix 
YaMrYaNs 


A mr, Ne=2jw 2 


a>0 Wa — a 


must be augmented by the additional “irrotational” 
(i.e., purely inductive) matrix Cmr, ve, given by formula 
(3.20). If the cavity is chosen so as to include a portion 
of each wave guide larger than the diameter, then 
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Fic. 1. Sketch of cavity showing terminal planes 
and natural boundary. 


Curx.s becomes approximately diagonal and has the 
form (3.34). 

One may obtain a rough estimate of the relative 
contributions of the two terms by using the asymptotic 
relation between the yay, and the we derived in 
reference 6, viz., 


T<Yanr>—Da, (3.35) 


where < Yani’ > denotes an average of yan.” over several 
values of a, and Da the average spacing between the wa 
in this region. Applying this result to all the individual 
terms a, except the lowest a=1, and supposing that 
ww , one finds (see reference 6 for details) 


2jwyine j wı 
Ans NOt 1g( 142). (3.36) 
u r D; 
Thus, the nonresonant contribution of ÁN, N: for 
ww, is approximately equal to j/r, while Cn. Na con- 
tributes — jKn./wı which is of the order — j VT y`? 
for the lowest wave-guide mode s, Ty being the area of 
the Nth terminal plane. This latter term will be larger 
than the nonresonant part of Á w., y, except if the cavity 
is very shallow in the direction of the guide. Since the 
nonresonant part of Y at any resonance (pole) wa serves 
only to change the width and position of the transmis- 
sion resonance slightly, one may say that at the lowest 
resonance w, these changes are determined largely by 
the inductive component C; at the higher modes, C 
becomes small because of its 1/w frequency dependence, 
and the changes in width and resonance position are 
then due mainly to the cumulative effect of the other 
(nonresonant) terms of A (see also reference 6, Sec. III). 
We are very much indebted to Professor R. H. Dicke 
for many valuable discussions. 
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Multiple scattering can be regarded as a succession of elementary events. The distribution function for 
the particles which have gone through n+1 elementary events is the convolution of two functions. The first 
of these expresses the scattering law; the second one is the distribution function for particles which have 
gone through » events. It is well known that such convolutions can be calculated very easily by means of 
Fourier transforms if the elementary event is the traversal of a free path in an arbitrary direction. In this 
case, the Fourier transform of the convolution is the product of the Fourier transforms of the convolvents. 
In the case of more general scattering laws, integrals over products of the distribution function, and of repre- 
sensations of the group which leaves the scattering law invariant, play the same role which the Fourier 
transforms play in the aforementioned case. From the present point of view, the exponential in the Fourier 
transform is a representation of the displacement group. It is shown that one can solve several problems 
of multiple scattering on the basis of the above observation. These problems include the scattering of a 
point particle without change of energy but an arbitrary angular distribution, and several more involved 


problems. 


INTRODUCTION 


ROSJEAN! has given, by direct calculation, a 
solution of the problem of multiple scattering in 
an infinite medium. The reason for taking up the same 
problem again is that the method of calculation to be 
presented is somewhat more transparent and also appli- 
cable to a wider range of problems. It is based on the 
symmetry of the problem considered and uses the 
theory of group representations. 

By multiple scattering we mean a succession of 
“elementary events” which change the state of the 
system and which are statistically independent of each 
other. In the theory of multiple scattering, the ele- 
mentary event is, in general, a collision and the sub- 
sequent traversal of a free path. It changes the state 
of the system from the one in which it is before a col- 
lision to the state in which it is before the next one. The 
statistical independence of elementary events will be 
guaranteed if the description of the state of the system 
is complete, i.e., if it extends to all the parameters. In 
many cases, this is not necessary. Thus, if one is 
interested only in the velocity distribution after a 
certain number of collisions, and if the medium in 
which the particle moves is homogeneous and extends 
over all space, one can suppress the position coor- 
dinates in the description of the state and consider, as 
elementary events, the changes in the velocity vector 
caused by the subsequent collisions. It is clear that the 


1C. C. Grosjean, dissertation, Columbia University, 1951 (un- 
published). This thesis also has rather extensive references to 
earlier literature. Among earlier papers, those of W. Bothe 
[Z. Physik 54, 161 (1929) ] and of S. Goudsmit and J. L. Saunder- 
son [Phys. Rev. 54, 773 (1939) and 58, 36 (1940)] anticipate 
most nearly Grosjean’s results. The last article solves, in par- 
ticular, the same problem which is treated by Eqs. (16) of the 
present note. Grosjean’s article goes further than these by being 
able to give a rigorous expression for the probability of a given 
displacement, not only for the probability of a given deflection. 
This corresponds to Eqs. (17) of the present note. For later 
developments of Grosjean’s method, see also C. C. Grosjean, 
Koninkl. Vlaam. Acad. Wetenschap. Letter en Shone Kunsten 
Belgié Jaarboek 13, No. 36 (1951) and Physica 19, 29 (1953). 
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statistical independence of subsequent elementary 
events is preserved under the conditions specified even 
though the description of the state is incomplete in this 
case.: The same holds in the case of spherically sym- 
metric scattering with respect to the direction of the 
velocity if one is interested only in the density dis- 
tribution of the particles, irrespective of the direction 
of their velocities. In the most important case of 
multiple scattering, in which a complete specification 
of the state of the system is necessary, the elementary 
act can contain, just as well, a free path with a sub- 
sequent collision. In this case, the elementary event 
changes the state from the one after a collision to the 
state after the next collision. 

The condition under which we can give an explicit 
solution of the problem is as follows. (1) The probability 
that an elementary event changes the state in a certain 
way is invariant under a group G. (2) Every state of 
the system is obtainable from a single fixed state by the 
operations of the group G. In the case of multiple scat- 
tering in an infinite homogeneous and isotropic medium, 
the group G contains all rotations and displacements in 
space, i.e., is the Euclidean group. The conditions of 
homogeneity and isotropy of the scattering medium 
express the fact that the scattering law, i.e., the law of 
the elementary event, is invariant under G. If the 
moving particle has no structure and does not change 
its energy as a result of the collisions, its state is com- 
pletely characterized by its position and the direction 
of its velocity. Hence, every state of the particle can 
be obtained from a standard one by a displacement and 
a rotation, i.e., by an element of G. As standard state 
one can choose, for instance, the one in which the par- 
ticle is at the origin of the coordinate system and its 
velocity is parallel to the Z axis. In the second case 
mentioned in the preceding paragraph, in which one is 
interested only in the velocity distribution, the scat- 
tering law will have the symmetry of the rotation 
group and G will be this group. If the collision does not 


18 E. 


change the energy of the moving particle, its state is 

given by the direction of the velocity vector and any 
such direction can be obtained from a standard one by 

rotation, i.e., by an element of G. Hence, our method» 
will be applicable to both these cases and a similar dis- 

cussion shows that it is applicable also in the last case 

mentioned above, —that of spherically symmetric scat- 

tering if the particle again does not change its energy in 

the course of the collisions. Further and more general 

examples will be given later. 


EQUATIONS OF MULTIPLE SCATTERING 
A. 


Even though the method of solution to be adopted 
will be essentially the same in all cases to be considered, 
it seems worth while to distinguish two cases at this 
point. In the first case, there is only one group element 
which carries the standard state e of the particle into a 
given state s. In this case, the group element in question 
can be denoted by s itself, the unit element e corre- 
sponding to the standard state. This is the situation if 
the particle is restricted to move in a plane, no matter 
whether its “state” has to be characterized only by the 
direction of its velocity, or by this direction and the 
position of the particle. Let us denote in this case the 
probability distribution (per unit volume element of 
the invariant group space) after n elementary events by 
fn(s). The volume element is simply dy, with p the 
angle between direction of the velocity and the standard 
direction, if one excludes the consideration of the posi- 
tion of the particle. It is dgdxdy, with x and y the 
rectangular coordinates of the particle, if one is inter- 
ested also in the probability of the position of the 
particle. If the energy also changes in the course of 
collisions, it becomes a fourth parameter of s and the 
volume element will depend on the energy dependence 
of the law of scattering. 

The probability that an elementary event change 
the state s into a unit volume element at ¢ will be de- 
noted by P(s,t). The invariance of this probability under 
the operations of the group means that for every group 


element u 
P(s,)=P(us,ul). (1) 


This equation already uses our assumption that there 
is a correspondance between states and the elements of 
the group which leaves the law of the elementary event 
invariant. Writing, in particular, «=s~' in (1), one 


obtains 
P(s,) = P (e,t) = P(t). (1a) 


This equation expresses the fact that the scattering law 
remains the same no matter whether one uses e or s 
as the basic state. The probability distribution /»+1(2) 
is given therefore by 


Fe f Plsdds= f PlsWds, (2) 
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where ds signifies the invariant group integration. 
(Right and left invariant group integrals are identical 
in all the groups to be considered.) It is very natural 
that the probability distribution after »+1 collisions 
be given by the convolution of the probability dis- 
tribution after » collisions and the scattering law. 

Because of the somewhat abstract nature of the 
derivation of (2), it may be worth while to illustrate 
it on the aforementioned examples. If, in two-dimen- 
sional scattering, the position of the particle is sup- 
pressed, the group G is simply the group of rotations in 
two dimensions and can be characterized by an angle g. 
The state (go) of the particle is obtained from the 
standard state, with the velocity directed parallel to 
the X axis, by a rotation with go; i.e., it is the state in 
which the velocity includes an angle po with the X axis. 
If ¢ and s in (2) are characterized by the angles £ and 
yo this equation becomes 


fanl) = f falo) Plo- v0)d yo, (3) 


since the angle which characterizes the group element 
sli becomes y— go. According to its definition, 


P(g')=0lp')/0, 


where e= fo(¢’)dg’ is the total cross section. Hence, 
(3) gives the usual way to calculate the angular distri- 
butions successively. 

If one wishes to obtain the spatial distribution of the 
particle as well as its velocity, the group G becomes the 
group of motions in two dimensions. It can be charac- 
terized by a displacement by x, y and a rotation by ¢ 
(the latter preceding the former). The group element, 


(3a) 


COSYo —SINGO Xo 10 x 
s= | singo coSgo Yol=|0 1 yo 
0 1 001 

coSsp —singo O 

-singo CoSgo Ol, 

0 0 1 


characterizes the state in which the particle is at xo, yo 
and the direction of the velocity includes an angle po 
with the X axis. A direct method to obtain P in terms 
of the cross section will be given in the section Calcu- 
lation of P(s,t). In the present case we wish to verify 
only a posteriori that (2) gives the well-known equation 
(3e) if 

P(x,y, g) = o(p) erà (y— e), (3b) 
wherer and y are abbreviations for the polar coordinates 
of x’=rcosy and y’=rsiny. The ô function in (3b) 
expresses the fact that the displacement is always in 
the direction of the velocity. For the verification of (2), 
let us denote the parameters of the group element s by 
20, Yo, Yo, those of t by x, y, p; the parameters of si 
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then become 


x’ = (x— xo) cospot+ (y— yo) singe, 
y'= — (x— xo) singot (y— yo) cos¢o, 
¢ = o— po. 


The polar coordinates of x’, y’ are r and Y=a— po, where 
r and æ are polar coordinates for x—x9=r cosa, 
y—yo=r Sina. Hence 


P(x y'o) =0(~— go)re-*"5(a— g), 


and with the invariant volume element for ds given 
above, (2) becomes 


f mlego)= f f f dxodyod pof n(%0,¥0, Po) 


Xa (p— po)r e"6(a— p). (3d) 


This becomes the well-known equation, 


In+1(2,9,9)= f f f drdad pofa (&0,90,%0) 
Xolp— po) Ts la— p) 
= f f drrpofn(&—r cosy, y—r sing, po) 
(3e) 


if one replaces the integration variables xo, yo by r and 
a and carries out the integration over a. It may be 
arguable that the well-known (3e) is simpler than (2) 


Xo(p- Ye”, 


but we shall see that (2) suggests more directly a 


method of solution than does (3e). 


B. 


The preceding calculation applies in the case in which 
G has only one element which transforms the standard 
state e into a given state s. This is not true if, for 
instance, a particle without structure moves in three- 
rather than two-dimensional space. The reason therefore 
is that there are rotations—those about the direction 
of the velocity in the standard state—which leave the 
standard state unchanged. The rest of the present 
section contains a discussion of the group theoretic 
description of the state and of the scattering law. While 
the former discussion is essential, the discussion fol- 
lowing (5a) is not necessary if one is interested only in 
using the present method. The result of this discussion 
is (9) which becomes evident at any rate if one cal- 
culates P explicitly. 

Let us denote the number of parameters which char- 
acterize the states by nı, the number of parameters of 
the group G by n. There will be then an n—n, para- 
metric subgroup E, the elements of which carry the 
standard state & into itself: 


e,6= & if e, contained in E. (4) 


(3c) 


Two group elements, s and s’, will carry & into the 
same state $ if 

S=sé=5'§; E=s—s'&. (4a) 
s^s’ is an element of the subgroup Æ, i.e., if s and s’ 
are in the same left coset of E. One can say that in the 
more general case now considered the left cosets of E 
correspond to the different states of the particle rather 
than the group elements themselves. 

We shall see that expressions of the form (2) are very 
easily evaluated and we wish to define, therefore, also 
in the present case a probability function f(s), depend- 
ing on the group elements rather than on left cosets. 
This f(s), if integrated over the elements of the cosets 
which correspond to a set of states, will give the prob- 
ability of the states of the set. In order to express this 
analytically, it is useful to introduce two types of 
parameters, ¢ and e, for the group G. There are nı 
parameters ¢ and they have the same value for all 
elements of a left coset and serve to distinguish these 
cosets, i.e., to characterize the states of the particle. 
There are n— n parameters e and they distinguish the 
various elements of the left cosets. One can choose, for 
instance, the parameters f and e in such a way that 
the element with parameters ¢ and e become 


s(F,<)=s(F,0)e(€), 


where e(e) are the elements of the subgroup E and 
s(t,0) is a continuous function of S(f). If we denote the 
probability of the states S({) per unit range of ¢ by 
F(t), we shall demand of f(s) 


(4b) 


F=f fede (5) 


where the integration over e is to be extended over the 
subgroup E, g(f,e) is the weight factor which makes 
g(f,e)d¢de the invariant integral, and s is the group 
element with the parameters f, e. This equation ex- 
presses the postulate that the integral of / over a coset 
give the probability of the state to which the coset 
corresponds. 

Clearly, (5) does not determine /(s) completely and 
we can further postulate that it have the same value 
for each element of a coset, i.e., be a constant along the 
path of integration of (5) 


f(s) =f(se1) if e; contained in Æ. (5a) 


All our probability functions shal] have the property 
(Sa). 

In a similar way, we shall try to replace the transition 
probability P(S,82)=P(o1,f2) by a function P(s,t) of 
the group elements s,/. This has to be done in such a 
way that if 


Fasi(ta)= f Fat P(tutaddts, (6) 
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and if F, and f„ correspond to each other in the sense 
of (5), (5a), then 


fae f falss)P(sa,ss)ds, (6a) 


also correspond to F„+ı in the same sense, no matter 
what the functional form of F, is. Since /ntı, as a 
probability function, will have to satisfy (5a), we can 
conclude at once that 


P (5,52) = P (S1,52€2) if e2 contained in E. 


(7a) 


Expressing now both F in (6) by the corresponding f, 
we find 


f Inrı(s2)g (F2,€2)de2= f f Fa(sidg (G1, 61) P (F1,62)dedts 


= f falsPCrta)ds1, 


where sz and s, are the group elements with the param- 
eters t26: and fı,eı, respectively, and the second line 
follows from the definition of g(¢,e)d¢de as the invariant 
group integral. The condition that the fn+ obtained 
from (6) satisfy (6a) is, therefore, 


f ds, f SOP Gtr O f dss fa(s:)P (tas). 


Since f, is not an arbitrary function of sı but satisfies 
(5a), the last equation does not fully determine P(s1,59) 
and we are free to let it depend on e, in an arbitrary 
fashion. The simplest choice is 


J PGusdeGneddea= peu, (7) 


according to which we have, in addition to (7a), 
(7b) 


According to (7a) and (7b), P(s,s)=P(si',s) if sı 
and s;’ are in the same left coset of E and the same 
holds of s2, sz’. This is indeed the most natural con- 
vention; it renders all quantities f(s), P (s, functions 
only of the ¢ parameters of their (group element) 
variables. It follows that P(s,,52) can be taken out of 
the integral sign of (7) and this equation, together with 
(7a), (7b), completely determines P(s,,52) once P(¢1,¢2) 
is given. 

The invariance of the law governing the elementary 
event can be expressed in the following way. Let us 
assume that the distribution will be given by fn41(se) if 
it was /„(sı) one event before. Then if the distribution 
is hn(sı)=fn(usı), it will go over into An+1(S2) =fny1 (use) 
after another event. This will hold for every group 
element u. One easily verifies that the A satisfy (5a) 


P(se1,52) = P(s1,52) if eı contained ‘in E. 
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if the f do. Expressing the above condition analytically, 
one finds that 


fa (0682) = hnya (S2) = f WGP Gaia 


= f In(usı)P(suse)dsı (8) 


must be a consequence of 


PO f fal(s)P(syA)ds. (8a) 


Since ds, indicates the invariant group integration, one 
can replace in (8) usı by s. Writing, furthermore, ¢ for 
usı, (8) goes over into 


fm) = f Sals) P (us umi)ds, (8b) 


and this will be a consequence of (8a) if 
[mortwas= f AOPE suas. (Be) 
Writing out the integration in terms of £,e, 
f EOP 
= f OPUsAget Adds 


where s is the group element with the parameters f,«. 
Since the last equation must hold for all fn, 


Pi f o(t,<)de= P (usut) f e(t,e)de. (8d) 


Both P(s,t) and P(u7's,u-t) could be placed before the 
integral sign because they are independent of e. Setting 
then u= s, one again obtains (la) for P and also obtains, 
from (6a), the same Eq. (2) which holds in the case in 
which there is only one group element which trans- 
forms the standard state into a given state. The only 
difference between the two cases is, therefore, the 
additional condition (Sa) on the distribution functions, 
and the conditions (7a), (7b) on P(s,f), which must be 
satisfied in order to make (2) valid also if there are 
group elements which leave the standard state un- 
changed and hence all the elements of a left coset 
transform the standard state into the same state. It is 
worth noting that (7a), (7b) give 


P(e,5€2) = P(s) for e€), es contained in E. (9) 


Hence, P depends in general on even fewer independent 
variables than f. The example of the following section 
will illustrate this point. 
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CALCULATION OF P(s,t) 


In the case of the motion of a particle in two-dimen- 
sional space without energy change, we obtained the 
expressions (3a), (3b) for P by inspection; insertion of 
these expressions into (2) gave the relatively simple and 
well-known equations for multiple scattering for this 
case and thus verified (3a) and (3b) a posteriori. We 
shall not follow this procedure in the case of the scat- 
tering of a particle in three-dimensional space without 
energy change, principally because the explicit form of 
the equation of multiple scattering in three dimensions 
[i.e., the analog of (3e)] is very complicated. In fact, 
one of the simplifications which the method here pre- 
sented introduces is the avoiding of these explicit 
equations and the possibility of evaluating (2) without 
writing down this equation in any other form. In order 
to use and interpret the solution, it is necessary, how- 
ever, to express P(s,/) in terms of the quantities com- 
monly used (i.e., the differentia] cross section) and to 
establish the connection between the f,(s) and the 
commonly used angular distribution F(Q). These con- 
nections are given, in principle, by (7), (7a), (7b), and 
by (5), (5a) but the corresponding relation will be 
explicitly evaluated now for the aforementioned case, 
i.e., scattering of a particle without change of energy 
under disregard of its position. It will be given also for 
the case in which the position of the particle is con- 
sidered also. 

The group G in the first case is the three-dimensional 
rotation group. As standard state, we choose the one 
in which the velocity is parallel to the Z axis. The sub- 
group E which leaves the standard state invariant 
consists of the rotations about Z. It is reasonable to 
choose the polar angles ¢, J of the velocity direction as 
the variables f describing the state. Equation (4b) 
then becomes 


$(¥,0,€) =S (2,9,0)2 (6), 


where Z(e) is a rotation by e about the Z axis and we 
can choose for s(%,9,0) any rotation which turns the 
Z axis into the g, 8 direction. If we choose 


5(,8,0) = Z(p)X (0), 


(10) 


(10a) 


the angles p, 3, e become the Eulerian parameters of 
the group elements. The distribution function f will 
depend, by (Sa), only on ¢ and # and we have, by (5) 
for the probability of a unit range in ¢ and #: 


F(9,0)= f i (2,8) sindde=2rf(p,) sind, (10b) 


sinddpdöde being the invariant volume element ex- 
pressed in terms of Eulerian parameters. 

According to (9), the function P(s) depends only on 
the parameter d—the simplicity of this condition is a 
consequence of (10a), i.e., of the use of Eulerian param- 
eters to describe the group elements. A similar simpli- 


fication can be accomplished, however, also in most 
other cases by choosing three sets of parameters f,, fo, € 
defined by the equation 


$(F1,62,€) = e($1)8(F2)e(€), (11) 


where e(fı) and e(e) cover the subgroup E and the §(f2) 
are so chosen that all elements of the group are obtained 
by letting {2 vary over a suitable domain. 

Since P(s,/) depends only on s~, it is sufficient to 
determine P (e,t) = P(t) where e is the unit element. We 
then have, from (7), 


f j P(8) sindde=o(9) sind/o. (10c) 


Since we have set sı=e the right side represents the 
probability that an elementary event change the 
velocity, originally parallel to Z, into a velocity at 9, 
e within unit range of dd and de. Since the differential 
cross section o(#) is usually defined per unit solid angle, 
the probability of the transition to unit dédy range 
becomes o() sind. It is an obvious consequence of the 
spherical symmetry of the problem that this prob- 
ability is independent of yg. This does not give an addi- 
tional condition, however, because the symmetry of 
P(s) is fully described already in (9). 
Summarizing, we have 


f(s)=f(¢,8,€) = (2r sind) F (9,8), 
P(s)= P(9,9,€) = (2r)"!e(d)/e. 


(12) 
(12a) 


It may be useful to repeat that F(%,0) is not the prob- 
ability per unit solid angle but per unit dydv interval. 
No sin”!d appears in the expression for f(s) if F is given 
in terms of the distribution function per unit solid angle. 

The expressions for the case in which one wishes to 
consider not only the direction of the velocity but also 
the position of the particle can be derived with equal 
ease. The group G for this case is the group of motions 
in three dimensions (Euclidean group) defined in the 
same fashion as the group of motions in two dimensions 
was. The distribution function F(x,4,2,9,0) depends on 
the position of the particle as well as the direction of its 
velocity. The group elements depend on six parameters: 
the Eulerian angles 9,0,¢ of the rotation, and the three 
components x,y,z of the subsequent displacement. The 
subgroup E contains only the rotations about Z; it is 
the same group as in the preceding example. The con- 
nection between the group theoretic f and the usual F 
is, in complete analogy to (12) 


J (s) =f (x,y > 9,0,€) = (2r sind) F (x,y 2) 9). (13) 


The form of P(s) is simpler if we consider as funda- 
mental process the traversal of a free path and a sub- 
sequent collision (i.e., if we adopt the second point of 
view of the Introduction). The probability of the tran- 
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sition from the standard state into unit interval at the 
state s= (x,y,2,0,0) then becomes 5(x)5(y)e~**a (8) sind. 
Hence, application of (7) to the case sı=e, s2=5 gives 


P(s) f sindde= ô (x) (y) "e (8) sind 


or 


P(s)= P(x,9,2, 9,0,€) = (27) (8) (x) 5(y). (13a) 


One convinces oneself easily that this expression satisfies 
(9). 

The above expressions could have been foreseen 
without the detailed derivation given above. In fact, 
the numerical factors in (12) to (13a) need not be 
known for actual calculations since the successively 
calculated distribution functions can easily be nor- 
malized a posteriori. 


EVALUATION OF THE EXPRESSION (2) 


The reason that (2) can be evaluated particularly 
simply if s and ¢ are elements of a group is that the 
group integral of the product of the convolute and the 
matrix of a representation is the product of two matrices 
which are obtained from the convolvents in a similar 
fashion. In fact, we obtain from D(s)D(u)=D(su): 


ftnoooa= f [more nDwWasd 
= f f In) Plu)D(su)dsdu 


= f fa(s)D(s)ds- f P(u)D(u)du. (14) 


The second line is obtained by substituting ‘= su and 
noting the invariance of the group integral with respect 
to such a substitution.? Since D(s) is, in general, a 
matrix, all expressions in (14) are matrices, with the 
number of dimensions of D. Written out in more detail, 
(13) reads 


f fasr(s)D(s)ardt 
=E f ODO uds fP@D@nds. (140 


This equation holds no matter whether the representa- 
tion D is reducible or irreducible. If D is the regular 
representation, one is led back to (2); the simplest 
results are obtained if D is irreducible. If (14) holds for 
a function fanų: and all irreducible representations, this 
Jm satisfies (2). 


3 Equation (14) must have been known already to G. Frobenius. 
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It follows from the above that 
f fa()D()ds=8 =p]; 
n= f P(s)D(s)ds. (14b) 


Again, the ® and II are matrices with as many dimen- 
sions as D. The evaluation of (14b) is often made very 
much easier by the fact that fo and P satisfy (5a) and 
(9), i.e., actually do not depend on all the group vari- 
ables. In particular, if one assumes the representations 
in the form in which the matrices corresponding to 
elements of E are in the reduced form, only those 
JS fo(s)D(s)xds will be different from zero in which v 
corresponds to a unit representation of E and both p 
and » must correspond to such representations if 
JS P(s)D(s)wds is to be finite. 

In the first case discussed in the last section, it 
follows either from (9) or more directly from (12a), 
that assuming the customary form of the irreducible 
representations? only the 0,0 element of II(N) is dif- 
ferent from zero. One has 


H()n= f P(s)D® (5)yyds 


2r r 2r (8) 
a f f f On Bede sindddde 
0 0 0 


zo 


ot 


= (25/2)8,o5u0 J o(9)doo (9) sinddd. (15) 


Hence, in all powers of the “matrix” II(}) only the 0-0 
element is different from zero and this is the corre- 
sponding power of (15). Similarly, it follows from (5a), 
or more directly from (12), that only the 0 column of 


(1) = fr (s)D (s)ds (15a) 


contains elements which are different from 0. This is, of 
course, consistent with (14b) and (15). 

Let us consider, for instance, the angular distribution 
obtained by means of Born’s first approximation, for 
Rutherford scattering on a shielded nucleus. For a 
potential proportional to r—e—" the angular distribution 
becomes, with B= a?/2k?, where k is the wave number 
of the particle, 


(0) __B2+B) 


o 4n(1-+B-cosd)? 


See, e.g., E. P. Wigner, Gruppentheorie und ihre Anwendung 
(Friedr. Vieweg, Braunschweig, 1931), Chap. XV. 


(16) 
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In order to evaluate the integral (15), it is most con- 
venient to write cos®=x and to use do in the form 


1 d(a—1)! 


doo (3) = P?(3) = a d (16a) 


This gives for II(M)oo (we shall omit the indices 0 for 


convenience) 
1 B2+8) 1 da)! 
Nd)= an f _—_—_ 
~14r(1+8—x)? 24! det 


6(2+8) f (7+ 1) !(1—27)! 
21417 | _1 (1+8— x) 


(16b) 


The second line is obtained by }-fold partial integration. 
For /=0, this gives (as always) IT(0)=1 for }=1, 


II (1) = 1+6—48(2+8) In(2+8)/B. (16c) 


This is the forward bias after one collision; after » col- 
lisions the forward bias in the nth power of this. (16c) 
has been obtained already in the earlier articles of 
reference 1. 
Let us consider, as a second example, the motion of 
a particle without energy change in two-dimensional 
space. The unitary irreducible representations of the 
corresponding group—the Euclidean group of the plane 
—-are infinite-dimensional and can be characterized by 
a continuous variable k: 
DR (7,¥,0) mim= UT nom (kr)e eV, (17) 


The notation is the same as in (3b): the group element 
(r,¥,¢) is given by the matrix of the equation preceding 
(3b) if one sets therein x=r cosy, y=r sind, po= ge. The 
very simple derivation of (16) is given in the Appendix.‘ 
J, is the Bessel function of order n. We define again 


T1() mm = f P(s)D® (5) mimi 


= J o(p) eai (Y— WE YIm_m(kr) 
Xei™ewWrdrdydp; (17a) 


the P(s) was taken from (3b). The integration over y 
can be carried out at once, and one obtains 


TI (2) mm = Í Sa f : eT m—m (Rr) dr 


=— gt (17b) 
(R?+0°)} 
where 
Ir Cc oc i 
| o(p)eirrdo; (142). (17c) 


‘For representations of similar groups, see also E. Inonu and 
E. P. Wigner, Nuovo cimento 9, 705 (1952). 


One sees that II(k) is again of rank 1. One easily proves 
by induction that 
[2ro (0) ]* 
CO) p+! =I (k), (17d) 
[ot ey 
where ø (0) is the differential cross section in the forward 
direction: 


21a (0)=>> om. (17e) 


It would have been possible, of course, to transform 
(17) in such a way that, similar to (15), only one ele- 
ment of the II(k) matrix would have been different from 
zero. However, the rather arbitrarily chosen form of 
(17) has caused no difficulty. 

In order to obtain the expansion matrices $("’(k) of 
fn from those of [II(k)]*, we have to multiply the 
matrix expanding fy with [II(z) ]*. If the particle is, 
originally, in the standard state, its expansion matrices 
are all unit matrices and #)(k)=[II(k)]*. Hence, 
these matrices play a role similar to that of the source 
solution of diffusion equations. In order to return, from 
the &, to the distribution function f, one has to invert 
equations of the form (15a). This is a trivia] matter if 
the group is closed and can be done easily also for the 
representations (17) by means of well-known expansion 
formulas for Bessel and trigonometric functions. The 
problem of inverting Eqs. (15a) for not closed groups 
has been attacked recently in a rather general form by 
Harish-Chandra and by Segal.’ In many cases, such as 
dealt with in Eqs. (16), the expansion matrices have 
more immediate physical significance than the dis- 
tribution function f itself. 


CALCULATION OF TOTAL DENSITY FUNCTIONS 


In many cases one is interested in the total distribu- 
tion of all particles, irrespective of the number of ele- 
mentary events they have passed through. Usually, this 
will be finite only if the elementary process can lead to 
absorption. | 

If the probability of absorption 1—y is independent 
of the state of the particle before the elementary event, 
the total distribution function will be 


= rn. (18) 
The corresponding momentum matrix becomes 
P= f HD Od=T vO) 
=O (HI yHD*=6()(1-vN@)). (18a) 


The right side of (18a) can be written down at once in 
all cases considered in the preceding sections. 


5]. E. Segal, Ann. Math. 52, 272 (1950) and Harish-Chandra, 
Proc. Natl. Acad. Sci. U.S. 37, 813 (1951). 
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APPLICABILITY OF THE METHOD 


It has been realized already by Grosjean that one can 
successively evaluate the distribution functions for 
multiple scattering, if one wishes to take the energy 
changes into account, most easily if the probability 
of a certain fractional change in energy is only a function 
of the scattering angle. This is the case for elastic 
scattering on atoms at rest. It seems natural to ask, 
therefore, whether the calculations presented above, if 
applied to a larger than the Euclidean group, could be 
given a physical interpretation. We shall not try to 
solve this problem in general but point only to the 
simplest generalization of the Euclidean group, that 
formed by the matrices 


cr a 


0 1 


; (19) 


In (19), r is a real orthogonal matrix, a is a real vector, 
c>0. For c=1, the s form the Euclidean group ; dropping 
this restriction increases the number of group param- 
eters by 1. Clearly, c will be connected in some way 
with the velocity of the particle. 

The condition of invariance means that the transition 
probability from s, to sə be equal, for all u, to the 
transition probability from us, to sz. It is enough to 
demand this for a set of # from which all group elements 
can be obtained by multiplication. We choose the fol- 


lowing u: 
r 0 
be 


0 1 


cl 
0 


Um » t= 


s (19 
| 2 


Ia 
0 1 


The above condition, applied to «=, shows that the 
transition probability from the state (¢),71,a,) to 
(62,72,9:) is equal to the transition probability from 
(cy, 71, aıta) to (C2, r2, a»+a). This shows that a plays 
the role of the position vector of the particle, or is 
proportional to the position vector. The invariance 
condition is also satisfied with respect to uz if r con- 
tinues to describe the direction of the velocity of the 
particle. Application to us gives, finally, that the 
transition probabilities 


(¢1,71,81)—> (C2,r2,a2) and (ccy,71,c81)—>(cc2,r2,ca2) (19b) 


are equal for all c. In other words, the probability of a 
path of length |c(aı—a,)| at the value cc, of the first 
parameter is as great as the probability of the path 
ja—az| at the value cı of the first parameter. Ex- 
pressed still differently, the mean free path is propor- 
tional to the first parameter. For the first parameter 
itself, the change from cı to cz is as probable as from 
cc, to cca. The simplest interpretation of this is that 


c=Cov*; (19c) 
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that is, c, and hence also the mean free path, is pro- 
portional to the »’th power of the velocity where +0 
but can be arbitrary otherwise. Together with the pre- 
ceding conditions, condition (19b) then stipulates that 
the angular distribution of the scattering be independent 
of the velocity and that the probability of a certain 
fractional change in energy depend only on the change 
in the direction of the velocity. 

It seems likely that many other multiple scattering 
distributions can be calculated accurately, many others 
approximately, following the procedure outlined above. 
So far, no case has been encountered in which the cal- 
culation of the nth power of II, or of the reciprocal in 
(18a), would have been at all difficult. On the other 
hand, it does not appear to be possible to use the 
rotation (or Euclidean) group to calculate the multiple 
scattering of a particle with spin, by using the variable 
e of (12) to describe the spin’s motion. It seems that a 
more powerful method is needed to overcome the dif- 
ficulties of this problem. 


APPENDIX 


The irreducible representations of the group of mo- 
tions in two dimensions can most easily be given in the 
Hilbert space of functions f(a) the variable a of which 
is restricted to the interval O<a<2r. The operators 
P,=P,,y,, of the representation in this space are 
defined by 


P.f(a) = Pıy, ef (a) = prik(r cosy cosatr siny sina) f (q— p) 
= ett ne f(a p). (A1) 


The group element s is a rotation by ¢ about the origin, 
followed by a displacement with a vector the polar 
coordinates of which are r and y. Borrowing concepts 
from quantum mechanics, one can say that f describes 
states in which the absolute value of the momentum 
is k but the direction of the momentum is variable and 
given by a. Rotation of this state by ¢ replaces a by 
a— g; displacement by the vector r cosy, r siny multi- 
plies it with the exponential in (A1). The magnitude of 
k characterizes the representation; it can assume any 
positive value. 
One can write (A1) also in the form 


Pf = [am (rre)eaf@)da’, (AD 
with a singular representation matrix: 
A® (7, p)ar =e Noa gpa), (A2a) 


and the calculation of the text can be carried out also 
using this form of A“. The form of D* given in the 
text is obtained by using, instead of f(a), its Fourier 
expansion : 


f(@)= Im fne. (A3) 
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Substitution of this into (A1) gives Because of f 
Pe. ES aa a a Tr Jn(2)= (Zr) i" f e~ cosh- inad,  (ASa) 

m m 0 


= E DE (r,,0)mmfme’, (A4) this differs from the expression given in the text only 
mm’ by the factor i" which can be eliminated by a 
where similarity transformation. It is worth noting that the 
27 transition from (17b) to (17c), i.e., the evaluation of 
DE (yo) m'm= (27)! f e~ thr cosa) eim(e—a)eim'ady the second integral of (17b), can be best carried out 
0 using the form (ASa) for J. 
= A similar calculation is possible also in the three- 
= (2r) f ehr cosp ilm =m PAR dimensional case but it is more laborious and will not 
o be given here. It is, essentially, contained in the Ap- 
Xetm—-mytime, (A5) pendix to Grosjean’s last article. 


Derivations of Onsager’s Reciprocal Relations 


E.P. Wigner 


Journal of Chemical Physics 22, 1912-1915 (1954) 


(Received May 24, 1954) 


Two derivations of Onsager’s reciprocal relations are presented in simplified form, and the range of their” 


validity is discussed. 


1, GENERAL STATEMENTS 


HERE appear to be two groups of derivations of 
Onsager’s reciprocal relations." In the first of 
these, it is assumed that the macroscopic laws of motion 
hold for the averages of the macroscopic coordinates 
(such as temperature gradient, concentration gradient, 
etc.) even if their values are microscopic, i.e., in the 
usual range of fluctuations.? The second group assumes 
a definite statistical law for the path representing the 
system in phase space.? Both types of derivations will 
be reviewed briefly below. 

Common to both derivations is the concept of macro- 
scopic coordinates Q:,-:+,Q, which are functions of 
the microscopic coordinates gı, ***, gv, Pi, *°*, Px. The 
latter are the coordinates of the phase space I’; the space 
of the former will be called y. A typical set of macro- 
scopic coordinates are the components of the tempera- 
ture gradient. This can be defined as 


(2/3k)>ai(p2/2m:— 3kT/2), 


and similar definitions apply to other macroscopic 
variables. 


1L. Onsager, Phys. Rev. 37, 405 (1931); 38, 2265 (1931). 
These articles contain also extensive references to earlier literature. 
See also I. Prigogine, Elude Thermodynamique des Phenomenes 
Irreversibles (Editions Desoer, Liege, 1947) and S. de Groot, 
Thermodynamics of Irreversible Processes (North Holland Publish- 
ing Company, Amsterdam, 1951). For a review of the recent work 
on the subject see E. M. Montroll and M. S. Green, “Statistical 
Mechanics of Transport and Nonequilibrium Processes” (to be 
published in Annual Reviews of Physical Chemistry). 

2 For the first type of derivation, see in addition to Onsager’s 
articles, particularly R. T. Cox, Revs. Modern Phys. 22, 238 
(1950); 24, 312 (1952). This article uses quantum concepts, in 
particular the notion of the transition probability. It is assumed 
that the system changes its state by means of spontaneous quan- 
tum jumps. This .assumption eliminates the induction period 
discussed in the text [after Eq. (14a) ]] and entails the validity of 
differential equations of the form (4a) for the macroscopic vari- 
ables. It is, therefore, not entirely clear that the reasoning con- 
tained in these articles would remain valid in all details if quantum 
mechanical concepts were used. In quantum mechanics, the prob- 
ability of a state decreases, initially, as 1—c# rather than 1—ct 
and the latter, linear, law takes over only after an induction period. 
Ideas similar to those of Cox have been put forward also by H. B. 
Callen and R. Greene, [Phys. Rev. 86, 702 (1952); 99, 1387 
(1952)], based on work by H. B. Callen and T. A. Welton [ Phys. 
Rev. 83, 34 (1951) Jand by J. Jackson [Phys. Rev. 87, 471 (1952). 
These papers also assume the “law of transition probabilities.” 

3 The basis of the second type of consideration seems to have 
been provided by J. G. Kirkwood [J. Chem. Phys. 14, 180 (1946)]. 
For the actual derivation see M. S. Green [Princeton dissertation 
ee and J. Chem. Phys. 20, 1281 (1952)] and N. Hashitzume 

Progr. Theoret. Phys. 8, 461 (1952)]. It was developed with 
ee elegance by L. Onsager and S. Machlup [Phys. Rev. 
1, 1505, 1512 (1953)]. 


There corresponds to every domain in y a domain in 
phase space but the ratio of the volume of the latter 
domain to that of the former is not constant. The in- 
tegral of ¢-7/*? over the domain in T space gives the 
unnormalized probability of the domain in y space to 
which it corresponds. This probability has a sharp 
maximum at the “equilibrium values” of the Q which 
will be assumed to be 0. Second, it will be assumed that, 
for small but macroscopic values of the Q, the entropy 
of the system can be expanded into a quadratic function 
of the Q: 


S(Q)—S(O) = — kE s" Q= —2(Q,5Q), (1) 


in which s= (s**) is a symmetric positive definite matrix 
and Q stands for Qı, ++, Qn. The normalized prob- 
ability of a unit volume element at Q is then given by 


P(Q)=Z-' exp(Q, — sQ). (1a) 


This is, by the connection between entropy and exten- 
sion in T space adopted here, the integral of e”?!*T over 
that part of T space which corresponds to the unit 
volume element at Q iny space, divided by the integral 
Z= fe*kTdpdg over the whole T space. Z can be 
defined also by the condition that 


f P(QdQ=1; Z=r*?|det s|}. (ib) 
One can easily verify, furthermore, that 
2 f 00.P(Q)40=e ao 
forms the reciprocal matrix to s 
resi (1d) 


Only ø will occur in subsequent work so that (1c) can 
be taken as the definition of o. The relation to the 
entropy is given, however, by (1) and (1d). 

It will be assumed, finally, that the Q contain the 
time variable only in even order, i.e., that they are 
invariant with respect to time inversion and that the 
same holds also for the whole system, i.e., its Hamil- 
tonian. Let us denote then by To the domain in T space 
which corresponds to a unit volume element at Q in y. 
Similarly, To, corresponds to a unit volume element 
at Q’. In proper units, the volumes of Tg and Tg: are 
equal to P(Q) and P(Q’). As time flows, the systems 
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represented by the points in Tọ will change. Let us de- 
note by To... the part of the domain of To the points 
of which move, in the time interval i, into the domain 
To. Then 


T,(0—0'’) = volume of To-.o-/volume of To: 
= volume of To-a’/P(O) 


is the transition probability from Q to Q’. Since every 
point in Tọ will arrive, after time ¢, in some T'o’, 


(2) 


f T,(Q—»0")dQ'=1. (2a) 


Let us consider now the domain '9.g* which can be 
obtained from To-.o’ by replacing all points qi, ***, QN, 
pi, tt, PN of Teg: by Qi, °'', QN, Pi, ***, — PN, OF, 
more generally, by applying the operation of time in- 
version to 9.9. Clearly, the volume of Too is equal 
to the volume of roq" 


volume of l'9.9°=volume of Tgog". (3) 


Furthermore, because of the invariance of the Q with 
respect to time inversion, '9g.9* is also contained in Tọ. 
We note, finally, that the points which are at time O in 
Ta-g'* were, at time —/, in Tg just as the points which 
are at time 0 in To-.g- will be, at the time /, in rọ. In 
fact, the points which are, at time 0, in 'g.g-* formed, 
at time —i, the domain To-.g. Hence, because of 
Liouville’s theorem, 


(3a) 


It follows (see Fig. 1) that the volumes of Too and of 
Too are equal, or because of (2) 


P(Q)T.(Q0'1) = P(Q)TA(Q’—Q), (3b) 


i.e., that the principle of detailed balance holds in y space. 
This conclusion is based solely on the postulate of the 
invariance of the Q and of H with respect to time in- 
version and, of course, on Liouville’s theorem and on 
the independence of H from time. 


volume of To'o = volume of Too”. 


2, ASSUMPTION OF THE MICROSCOPIC VALIDITY, 
ON THE AVERAGE, OF THE MACROSCOPIC 
EQUATIONS 


It is in conformity with the idea of a relaxation phe- 
nomenon to demand, at least for macroscopic values of 
the Q, that the average values of the Q, at time £, be 
linear functions of the initial. values 


f T(Q—0)0,'40’=Ee() "Or. (4) 


In fact, from a purely macroscopic point of view, one 
would expect (4) to hold even for infinitely short times, 
i.e., one would expect equations of the form 


(d/di)Q.() = ~E lQ: (4a) 


to be valid. This would not be consistent, however, 
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with the microscopic point of view; in fact, because of 
the invariance of the Ọ with respect to time inversion, 
the average of each dO/di over Ig vanishes. However, 
the integrated form (4) of (4a) can be expected to be 
valid after a certain induction period. This induction 
period ż is so long that dQ/dt already undergoes sub- 
stantial changes during this period, i.e., |d?Q/dé?|z is, 
for most points of Tg, much larger than |dQ/dt|. We 
shall use, below, only the integrated form (4). 
However, the integrated form will be assumed to be 
valid for all, even microscopically small Q. The re- 
ciprocal relations then follow at once: multiplication of 
(4) with P(Q)O: and integration yields, because of (1c), 


f J P(Q)T(Q-0')0:0,'dQdQ’= 45 (Four. (5) 


One sees by means of (3b) that the left side is sym- 
metric in / and i, . 


f J P(Q)T(Q0')0.0,'dQdQ’ 
-f J P(Q)T.(O'Q)0,0,'d0dQ' 
-Í J P(Q)T(Q-0')0.0;'20'aQ. 


The last part follows by interchanging the variables Q 
and ©’. It follows that the matrix eo is symmetric 


(5a) 


The last form shows that (5a) holds also if any function 
of e is substituted for e; (5a) embodies the reciprocal 
relations. Because of o=s”!, one can write it also in 
the form 


e=(eo)=od or Tleo=d. 


(Sb) 


se=e's=(se)'. 
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It was first noted by Casimir‘ that the validity of the 
assumption, that (4) holds even for very small Q, is very 
difficult to assess. In fact, the significance of (4) for 
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cally, it is assumed that 
TAQ’) 
=C exp— {(Q,aQ)+2(Q’,6Q)+ (Q’,7Q’)}. (6) 


such values of Q as are assumed in the course of ordi- 
nary fluctuations, is quite different from the macro- 
scopic case. In the latter case, the T,(Q>Q’), as 
function of Q’, has a rather sharp maximum, the width 
of which is much smaller than its distance from the 
point Q’=0. If the Q are microscopic to begin with, 
most of the systems represented by Tọ will have gone 
through all the planes Q,’/=0 several times during the 
time interval t; after the time / the system will have only 
a rather faint memory of its initial state Q. It is not 
at all clear that this memory should be subject to the 
same macroscopic laws which govern the values of the 
Q in the macroscopic case. In the macroscopic case the 
value of the macroscopic variables at time ż is given by 
the right side of (4) for practically all systems repre- 
sented by the points of Tg. 

The above point can be illustrated on the example of 
the stopping of a macroscopic particle by the viscosity 
of the medium in which it is moving. If the velocity of 
the particle is many times in excess of the equilibrium 
average value, a Stokes’ flow will establish itself around 
it. The energy loss will be determined by the properties 
of this flow. On the other hand, if the energy is of the 
order kT, it is very likely that the atoms around it will 
not assume any ordered motion. One must recall that, 
in thermal equilibrium, there are no correlations be- 
tween the velocities of any particles: e~*/*7 is, as far 
as the velocities are concerned, a product of functions, 
each depending on one velocity only. Hence, on the 
average, there is no Stokes’ flow; an ordered motion of 
the gas is contingent on the macroscopic body having 
moved, some time before the observation, with an 
even higher than its velocity at the time of observation. 

The reader interested in mathematical implications 
may note that, according to (3b), 


K(Q,0’) = P(Q)'T(QQ’) P(Q (Sc) 


is the kernel of a symmetric operator in the real Hilbert 
space of square integrable functions of the Qu, ++, Qn. 
The normalization property (2a) (which was not used 
in the above derivation) then expresses the fact that 
P(Q)? is a characteristic function of K with character- 
istic value 1. The macroscopic equations (4) show that 
the subspace of the functions Q;P(Q)! is invariant 
under K. 


3. MOTION OF THE SYSTEM CAN BE REPRESENTED 
AS A BROWNIAN MOTION IN THE SPACE OF 
THE MACROSCOPIC VARIABLES 


A rather different and very suggestive derivation of 
the reciprocal relations? is based on the explicit assump- 
tion that the point representing the state of the system 
in y space performs a Brownian motion. More specifi- 


1H. B. G. Casimir, Revs. Modern Phys. 17, 343 (1945). 


The matrices a and y are symmetric. They;as well as £,. 
depend on /. The normalization condition (2a) can be 
satished only if a=8’y~'8. Hence, writing e= —+7~'8 
one obtains : 


T(Q—Q’)=C exp— (Q'— eQ, y(Q’—«Q)). (6a) 


This shows that y must be positive definite. One sees. 
that the most probable value of the Q’ after the time /, 
as well as their average value, is given by Q’=«Q. 
Hence, e plays the same role in the present derivation 
as it did in the preceding section. 

Clearly, (6a) constitutes a particularily simple as- 
sumption for T, which entails, furthermore, the validity 
of (4). Hence, the reciprocal relations will surely follow 
from (6a) and (3b), if (6a) is assumed to be valid for all 
Q. However, and this is the point of the present con- 
sideration (in addition to the suggestive nature of the 
picture which it presents), it will be necessary to assume 
(6a) only for macroscopic Q. Instead, the form (1a) 
will be explicitly assumed for P(Q). One may recall, 
however, that (1a), as well as (3b), followed from the 
general principles of statistical mechanics. 

Inserting therefore (1a) and (6a) into (3b), one 
obtains 


exp— {(Q,sQ)+ (Q’— €Q,7(Q’— «Q))} 
=exp— {(Q’,sQ’)+ (Q—«Q’,7(Q— «Q’))}. (6b) 


One has to assume the validity of this equation only 
for a few different values Q, Q’, to conclude that 


um 


e'y= YE. (7) 
Inserting the second equation into the first one gives 
stye=yor 


steye=y 


y=s(1— ê), (7a) 
and substitution of this into the second one 
es(1— i= s(1— e) “le. (7b) 


Since 1— e and e commute, this is equivalent to (5b). 
i.e., the reciprocal relations. 

It might appear from the above that the assumption 
(6) is used only in a very mild form. This is indeed true 
as far as the necessity is concerned to use (6) for Q 
which are close to equilibrium: It is not necessary to 
use (6) for such Q. However, at least one of the pairs 
Q—Q’ or Q'—Q in (7) is such that the corresponding 
transition probability is extremely small. The assump- 
tion that (6) can be used in such a case is a rather 
stringent one. 

Again, the reader may be interested to note the con- 
sequences of the Markoffian nature of the process, i.e., 
the consequences of the condition that Fee (Q—Q”) is 
the integral, over dQ’, of T(Q—Q^) Te (Q'—Q”). This 
condition gives two equations for the matrices e(f) and 
y(t) which occur in (6a). The first of these is obtained 
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most easily by means of (4) which is valid also in this 
case 


e(i+)=e(!’)e(t). (8) 
It follows from this that 
ed=er", (8a) 


where J is a positive definite matrix. The condition for y 
reads - 


ee elt y Oe e). (9) 
This is satisfied by the y given by (7a): 
y) = (1-e "4a, (9a) 


if the reciprocal relations (5a) are assumed to hold. 
Both / and ø are independent of ¢. However, (9a) is not 
a consequence of (9) but follows from it only if (3b), 
(1a) and (1d) are also assumed to be valid. Actually, as 
was explained after (4a), none of the equations (8) or 
(9) holds for very small /. | i | 
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It might appear from the above discussion that both 
derivations of Onsager’s relations assume additional 
conditions which are hard to justify rigorously. Ac- 
tually, as was pointed out by Casimir‘ (and in a par- 
ticular case already by Onsager)!, there are cases in 
which the deviation from the equilibrium decays ex- 
ponentially even after it has become so small that it is 
in the normal range of fluctuations. According to 
Casimir, this will always be the case for processes which 
can be described by means of Boltzmann’s distribution 
function f(2,9,2,2,2,2:; À in the space of the variables 
of single atoms and if the deviation of f from the equi- 
librium f is sufficiently small at every point of this space 
to justify the usual linear approximation. In such a case, 
the first derivation! is fully justified. No similar argu- 


ment-can be made on the basis of ¢onsiderations in T 


space, because the deviation from the equilibrium dis- 
tribution in T space is large for all nonequilibrium 
systems. 


Lower Limit for the Energy Derivative 
of the Scattering Phase Shift 


E.P. Wigner 


Physical Review 98, 145-147 (1955) 


(Received December 10, 1954) 


It is shown that the derivative of the scattering phase shift with respect to energy, dn/dE, must exceed a 
certain limit if the interaction of scattered particle and scatterer vanishes beyond a certain distance. This 
limitation of dn/dE is, fundamentally, a consequence of the principle of causality; it is derived, however, 


from a property of the derivative matrix R. 


I 


TE cross section and its angular dependence, as 
functions of energy, do not seem to determine in 
general the phase shifts uniquely.! It may be useful, 
therefore, to derive certain general rules about the 
energy dependence of phase shifts which may facilitate 
the choice between the apparently equivalent sets of 
phase shifts. The relations to be derived here are based, 
fundamentally, on what has come to be called “the 
principle of causality.” It states that the scattered wave 
cannot leave the scatterer before the incident wave 
has reached it. However, the calculation to be carried 
out will make use of a single property of the derivative 
matrix R which was given already by Eisenbud and 
the present writer.” 

Before carrying out the very simple calculation, the 
general nature of the result will be illustrated by means 
of Eisenbud’s interpretation of the energy derivative 


of the phase shift as time delay.? Let us consider, for 


sake of simplicity, a scattering center of radius a, i.e., 
assume that the incident particle behaves like a free 
particle outside a sphere of this radius. Let us consider 
then an incident beam which is the superposition of two 
monoenergetic beams of energy h(v+v’) and h(v—v’), 
respectively. The corresponding wave numbers will be 
denoted by k+k and k—k’. Hence, 


Ving = rT (EHR iby DEL gil tr), (1) 


Both k’ and v’ are infinitesimally small so that (1) is a 
substitute for a wave packet,‘ the center of which is at 
the point where the two spherical waves of (1) are in 
phase, i.e., where 


2k'r+-2v't=0. (1a) 


Since v'/k’=dv/dk is the velocity of the particle, the 
incident particle indeed moves with a velocity v toward 


1 See, e.g., the articles of Fermi, Metropolis, and Alei, Phys. 
Rev. 95, 1581 (1954); de Hoffmann, Metropolis, Alei, and Bethe, 
Phys. Rev. 95, 1586 (1954); and of R. L. Martin, Phys. Rev. 95, 
1606 (1954) for the ambiguities in the case of pion-nucleon 
scattering. 

2E. P. Wigner and L. Eisenbud, Phys. Rev. 72, 29 (1947), 
see (i) on page 35. 

®L. Eisenbud, dissertation, Princeton, June 1948 (unpublished). 

t Instead of the superposition of only two monoenergetic waves, 
one can use a regular wave packet in this consideration; r in (1a) 
and (ic) then becomes the coordinate of the center of mass of 
that wave packet. 


the scattering center. If 9+’ and n—n' are the phase 
shifts which correspond to the energy values h(v+v') 
and A(v—v’), the outgoing wave will be 


ur (ef +k ri Hi) 
+ei®-r-it eier), (1b) 
The two waves of (1b) are in phase where 
2k'r—2v't-+-4n' =0, 
i.e., where 
r= —2n!/k! + (v'/k')t= —2dn/dk+ (dv/dk)t. (1c) 


One sees that the outgoing wave is retarded by a stretch 
2dn/dk; it arrives at a point r—2dn/dk at the time it 
would have arrived at r without the action of the scat- 
tering center. The causality principle as formulated in 
the first paragraph gives no upper value for the re- 
tardation 2dn/dk: if the particle is temporarily captured 
by the scattering center, there is no reason for it not 
being retained an arbitrarily long time. However, the 
“retardation” cannot assume arbitrarily large negative 
values, in classical theory it could not be less than — 2a. 
It will be seen that the wave nature of the particles does 
permit some infringement of the relation 


dn/dk>—a. (2) 


It will be shown that, nevertheless, (2) is essentially 
preserved also in quantum theory. It does hold, in par- 
ticular, for large k. 

The relation (2) gives a simple physical interpretation 
to the qualitative behavior of the energy dependence 
of n. Close to resonances, where the incident particle 
is in fact captured and retained for some time by the 
scattering center, dn/dk will assume large positive 
values. On the other hand, dn/dk will be close to —a 
at energy values at which the incident particle hardly 
enters the scatterer. One would expect (on the basis of 
Liouville’s theorem or the completeness relations) that 
the two effects, on the whole, balance each other, i.e., 
that the integral of dn/dE over the whole energy range 
is close to zero, at least if the scattering can be de- 
scribed by a nonsingular potential. This is indeed the 
case: 7=0 at E= œ and n=br for E=0, where b is the 
number of bound states. Hence, if the cross section 


5 I am much indebted to Dr. V. Bargmann for this observation. 
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shows a resonance behavior, one will expect n to de- 
crease slowly between resonances and increase fast at 
resonances, increases and decreases almost exactly 
balancing if considered over the whole energy spectrum. 


u 


In order to derive a rigorous minimum value for 
dn/dk, it will be necessary to assume that there is a 
radius a outside of which the wave function is that of a 
free particle. This is a somewhat artificial assumption 
which will be hardly ever satisfied rigorously. One can, 
therefore, expect that the condition (4b) to be derived 
under this assumption will be violated occasionally. 
However, if a is chosen reasonably large the violations 
will be uncommon and will extend over narrow energy 
intervals. 

We define as internal region the inside of a sphere of 
radius a and call the rest of space the external region. 
The wave function y in the external region is then given 
by 


rp=I(r)—eI*(r), (3) 


where I(r) is the radial part of an incoming spherical 
wave (of arbitrary angular momentum), its conjugate 
complex /*(r) is the outgoing wave; 7 is called the 
phase shift. For n=0, Y must be regular at r=0, which 
fixes the phase of J except for J’s sign. We shall nor- 
malize J so that 


IN! (r)*—I(r)*I’ (1) = 23. (3a) 


I(r) then represents a wave with flux m/h. The prime 
denotes the derivative with respect to r. Both J and ņ 
also depend on the energy; we shall use, instead, the 
wave vector & as variable and denote differentiation 
with respect to k by a dot. 

The value of the reciprocal logarithmic derivative R 
of y, with respect to r, at r=a, becomes 


I—einy* 
R= 


—, 3b 
I!’-eHI’* > 
The J, J’, etc., without argument, denote the value of 
the corresponding expression at r=a. Calculation of 
ei" from (3a) yields 

eins (T—J'R)(I*—I'*R)7. (3c) 
It may be remarked, parenthetically, that this expres- 
sion remains valid if, in addition to scattering, trans- 
mutations are also possible. The left side has to be 
replaced then by the collision matrix, R by the deriva- 
tive matrix, and /, T’, etc., by diagonal matrices, the 
diagonal elements of which are the /(a), T’ (a), etc., for 
the channel to which the row of the collision matrix 
refers. 


6 It is necessary, in collision theory, to fix also the sign of I(r) 
in terms of the decomposition of the plane wave into spherical 
waves. This is not necessary, however, for the derivation of (4b). 
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In the present case, all quantities in (3c) are numbers 
and logarithmic differentiation of (3c) with respect to 
k gives 


ooo aooo] = 
Aal I-TR m—I*R J 
The term proportional to È in (4) is 
—T'(I*—I*R)+I*(I—I'R) . R 
Be ee BS 
2i|T—I'R|? I-TR| 


The last part follows from (3a). The partial fraction 
expansion? of R shows directly that R= (R’k/m)dR/dE 
>0 for k>O so that omission of the term containing R 
will make the left side of (4) larger than the right side. 
Elimination of R from the resulting inequality by 
means of (3b) yields 


dn/dk=n>% Re[l’I*-1I*+ (fl —I' De"). (4b) 


Re[---_] denotes the real part of the expression con- 
tained in the bracket. The computation leading from 
(4) to (4b) involves the use of (3a) and the equation 
obtained from it by differentiation with respect to k.® 
It remains to point out that the partial fraction 
expansion of R, though obtained in reference 2 by 
direct calculation, has been shown’ to follow also from 
the causality condition described at the beginning of 
this note. This establishes the relation between (4b) 
and the qualitative consideration based on Eisenbud’s 
early work. The connection becomes even clearer if 
one inserts, into (4b), the expressions for J for J=0 
or/=1: 
To(r)=k-te-**rs An)=kAllk)"+i)e". (5) 


These give, in the former case, 


o> —a+ (2k)! sin2 (no+ Ra) ; (5a) 
while one has, in the latter case, 
m> —a+ (k'a) (1 — cos2 (m+ ka)] 
— (2k) sin2(ņı+ka). (Sb) 


It will be noted that, as long as ka<<1, these equa- 
tions actually entail 7>0 for a reasonably large interval 
of n. However, at very low k, (5b) merely expresses the 
fact that 7 and 7 are proportional to k and &°, respec- 
tively, without limiting the proportionality constant. 

Naturally, the proper limitation of a constitutes the 
principal difficulty in using (4b) to select the actual 


7 In addition to Eisenbud’s doctoral dissertation, see W. Schutzer 
and J. Tiomno, Phys. Rev. 83, 249 (1951) and, in particular 
N. G. Van Kampen, Phys. Rev. 89, 1072 (1953); 91, 1267 (1953). 
Also J. S. Toll, doctoral dissertation, Princeton University, 1952 
(unpublished), and the more recent article of Gell-Mann, Gold- 
berger, and Thirring, Phys. Rev. 95, 1612 (1954). This article also 
contains references to the early literature. 


Lower Limit for the Energy Derivative of the Scattering Phase Shift 


SCATTERING PHASE SHIFT 


phase shifts among the sets which reproduce the cross 
section. An estimate of a can be obtained only from an 
at least qualitative theory of the nature of the inter- 
action. It is commonly believed, for instance, that the 
pion-nucleon scattering proceeds via absorption and 
re-emission of the pion by the nucleon. If this is the 
case, @ will be of the order of the pion Compton wave- 
length, i.e., 4/uc. However, it would be quite difficult 
to tell to what extent a=2%/uc is a permissible choice 
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or whether it is necessary to assume a=3%/uc or an 
even larger a, giving less and less stringent forms to 
(4b). An alternative form of applying the relations of 
this paper, which might be somewhat more free of this 
ambiguity, would be to plot R, as calculated from (3b), 
against the energy and to judge whether any possible 
deviation of the R obtained this way from a regular R 
function can be blamed on having assumed a too low 
value for a. 
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Our appreciation of the significance of collision theory has increas- 
ed immensely in the post-war years. This appreciation has not yet 
had an adequate effect on the textbooks for courses in quantum 
mechanics and the evident need for learning the subject has created 
a veritable flood of treatises and reviews. As a result, the same basic 
ideas and applications have been described many times and there 
is a certain repetitiousness in the literature. This will apply also 
to the present review, the purpose of which is, as I understand my 
assignment, to serve as a background for the deliberations of this 
conference. Collision theory is one of the pillars which support 
the structure of transport theory. However, one may as well admit 
at the outset that, from the point of view of first principles, this 
pillar is not fully integrated into the structure. Transport theory 
should be a many body theory both in classical and in quantum 
physics, i.e., one should at least start from the many body Hamil- 
tonian and many body wave functions. The reduction of the many 
body problem to the problems we commonly consider 1s plausible, 
in fact practically surely correct in the case of a dilute gas. However, 
it has not been carried out rigorously to date “) and it is surely 
impossible to carry it out rigorously except in the limiting case of 
a dilute gas. 


The problem which I just touched upon will not be part of this 
review, which will deal, rather, solely with collision theory. It will 
consist of three parts : a shorter first part dealing with the funda- 
mental aspects of collision theory, and a somewhat longer third 
part dealing with its methods and applications. Sandwiched between 
these two parts will be some remarks on R-matrix theory. 
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I. FUNDAMENTAL CONCEPTS 


The conceptual significance of the collision matrix, originally 
introduced by Wheeler 2), was recognized by Heisenberg 3). In 
order to define the collision matrix, it is not necessary to give the wave 
function for those parts of the configuration space in which the 
colliding particles are in intimate contact. Rather, it suffices to 
specify the state of the system long before, and long after, the collision. 
Equivalently, one can specify the wave function asymptotically, i.e., 
for those parts of the configuration space in which the colliding 
particles, and the collision products, are well separated. In this 
region, the wave function for a stationary state will have two parts : 
that describing the incident beam and that the separating collision 
products. Let us denote the incident beam which contains particles 
of the type c by tic. The letter c describes the nature of the colliding 
particles (whether H and Cl», or H and an electron) and their state 
of excitation. The wave function in the aforementioned part of the 
configuration space then has the form 


Vie == 2 See! Voc’ (1) 


where poe is an outgoing wave, describing separating particles, the 
nature and state of excitation of which is characterized by the index c’. 
The quantities Sec’ form the collision matrix. It has as many rows 
and columns as there are states c’ which can be produced by the 
collision of the particles characterized by c. It is a function of the 
energy E of the system and also the directions into which the particles 
which constitute the pairs c and c’ travel. As far as the end result 
is concerned, the collision matrix describes the collision completely, 
and it describes it not only for the collision in which the particles 
c collide but also for collisions in which any of the possible collision 
products of this collision collide. The row index c can assume the 
same values which the column index c’ can assume. 


Heisenberg’s early publications do not provide a method for 
calculating the collision matrix without considering the whole pro- 
cess of the collision, i.e., the state vector at times at which the collid- 
ing systems are in intimate contact. As a matter of fact, the develop- 
ment immediately following Heisenberg’s suggestion led first to 
Dyson’s formula “ for the S matrix 


S = Z(iR)"0 J... JW) ... W(ta)dt, ... dtn/n'. (1a) 
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Here V(r) is the operator of the interaction, in the interaction (or 
Dirac) representation, between the colliding bodies; 6 signifies that 
the time ordered product of the ensuing expression has to be taken. 
The integrations have to be extended over all time. In field theory, 
V(t) is an integral of the interaction operator over all space so that 
the term on the right side of (la) contains, effectively, a space-time 
integral over n quartets of variables. Hence, (la) can be written in 
such a way that its relativistic invariance is at once evident. Clearly, 
(1) is but another form of the Born series ‘), i.e., the expansion of the 
scattering and collision amplitudes (which are essentially the matrix 
elements of S) into a power series with respect to the interaction 
operator. There is only one such power series and, even if they 
have different forms, two power series for the same amplitude must 
be equal term by term. 


The principal significance of (la) derives unquestionably from the 
fact that it lends itself to calculations, transformations and modifica- 
tions in which the relativistic invariance is never lost sight of. How- 
ever, (la) became a milestone in another way also : it marks the 
renewal of the attempts to use the Born series both for the actual 
calculation of scattering and collision amplitudes, and also for the 
derivation of general theorems concerning these amplitudes. Prior 
to the establishment and detailed investigation of (la), it was com- 
monly believed that the first term of the Born series (the so-called 
Born approximation) may be useful for the calculation of cross 
sections in particularly simple cases, or that it may serve as a general 
orientation, that the use of the whole series would be, however, 
impracticable and may also yield incorrect results. 


There is one particular transformation of (la) which, though 
perhaps not of conceptual significance, is sufficiently important to be 
mentioned at this point. It represents a return from (la) to a form 
more closely resembling the Born series. Since 


iH ot/h -iH o/h 


V(t) =e (2) 
the matrix element of V(t) between two states U; = fa(E) U(E, «)dE 
and f= fa'(E’) VE’, «')dE’ (where « refers to the quantum numbers 
which are needed, in addition to the energy, to characterize the state 
completely), becomes 
<u| VQ) [by > 


iE’ s/h ~iEWh 
= fe a(E)*e a(B)\dEdE’ < E',«'|V|E, a>. (3) 
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Hence, the integration over f can be carried out and gives for the 
n= |] term of (la) 
B; = (2zx/i) f S(E — E’)a’(E’)* a(E)dE'dE < E', «'|V|E, x>. (3a) 
This is the first significant term of the Born series, the so-called 
Born approximation. Similarly, the integrations over all the variables 
l, -..-, În can be carried out in the nth term and one obtains for 
this term the limit e > 0 of 
Ba = 2zi Ur [dE,...dEn_ı dEdE' a'(E')*a(E) ò (E—E') 
Oi ! 1 
<E, a | V | Ejay > 
<Ea |V |E222> <E,a2|V|E303> <En_)4n_1|V|Ea> (4) 
E; — E — iz E,—E—ie ' Eng —E—is — 


The expression <;|S—1|ls> = EBn is called the Lippman- 
Schwinger series. The individual terms B„ for which (4) gives 
a convergent result are relativistically invariant if V is the interac- 
tion of a relativistic field theory, but this is not evident any more 
in the form (4). 


Heisenberg’s suggestion to calculate the S matrix without consider- 
ing the process of collision in detail was realized more nearly by 
the dispersion theoretic approach. This was considered, in some 
detail, at last year’s conference, Goldberger and Mandelstam having 
been the main reporters. Since that time, we have learned a great 
deal more about the Regge poles, which are also an outgrowth of 
dispersion theory. The subject has been summarized so often that 
I feel a bit embarrassed to give yet another review. 


The most symmetric coordinates for the description of a collision 
are the four-momenta of the colliding particles and the negative four- 
momenta of the particles resulting from the collision. In collisions 
of two particles which result again in two particles, there are four 
such momenta which may be denoted by pi, P2, P3, Pa. The time 
components of two of these are positive, those of the other two 
negative. The sum of the four-momenta 


Ip; = 0 (5) 


because of the conservation laws for linear momentum and energy. 
Because of its relativistic invariance, S depends only on the invariants 
which can be formed from the p;. Because of (5) and 


p? = m:?, (6) 
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where m; is the mass of the particle i, there are two independent 
invariants. (The square of a four-vector will always denote its 
Lorentz product with itself.) The most symmetric set of invariants 
are the Lorentz-lengths of the four vectors which contain two of 
the pi with positive, two with negative, sign. It is customary to write 


[((p1 + P2 — P3 — pa/2 = (i + Po)? = (P3 + p4?=s (6a) 
(Pi — P2 + P3 — P4)/2} = (Pı + D3)? = (P2 + pa)?= t (6b) 
[(Pi — P2 — P3 + PAP = (P2 + P3} = (pı + p4)?= u. (6c) 


The three invariants s, t, u are not independent but obey the 
identity | 
s+tit+tu= im? (7) 


The s, t, u are the variables used most conveniently to formulate 
the dispersion relations. If the p; are the four four-vectors of an, 
actual collision, one of the variables s, t, u is positive, the two others 
negative. If pı and p, are the four-momenta of the colliding particles, 
s is the positive one and gives the square of the energy in the center 
of mass coordinate system (p,° + p2°)*; t and u are negative squares 
of differences between final and initial momenta so that one of 
these is the square of the “ momentum transfer ”. 


In order to formulate the dispersion relations, the definition 
domain of S(s, t, u), in which s, t, u are real and obey the preceding 
equations, 1s extended to all complex values of its variables. The 
dispersion relations are then postulates concerning the region of 
analyticity of S and its behavior as s, t or u go to infinity. In addi- 
tion, S must be unitary for physically possible s, t, u, i.e., for real 
s, t, u which satisfy the aforementioned requirements. It is surmised 
that these postulates, coupled with some information concerning 
the low or high energy (s, t, or u) behavior of S suffice to determine S 
completely. 


Since we are interested principally in the low energy (that is, 
non-relativistic) behavior of the scattering cross sections, it is not 
necessary to use the full apparatus of dispersion theory and Man- 
delstam representation to obtain significant results. The variables 
commonly used for describing low energy scattering are the wave 
number k, giving the kinetic energy %?k?/2u in the center of mass 
system (u is the reduced mass of the colliding particles) and the 
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momentum transfer p, giving the scattering angle$ by p = 2kA sin 9/2. 
Hence, one can write 


>> > > . 
S(k,k) = &(k — k + (E — E^S(k, p). (8) 


The postulate of the dispersion theory is, then, that for fixed, real p, 
the scattering function S(k, p) is analytic in the upper half plane of 
the complex variable k, except that it can have poles on the imaginary 
axis corresponding to possible bound states. Further, S(k, p) tends 
to the same value S,(p) no matter how k tends to infinity in the 
upper half plane. The limit, S,(p), is given by the first term of the 
Born series. One has, further, 


S(k, p) = S(k, — p) = S(—k*, p*)*. (8a) 


If there are no bound states, it follows from the behavior of S in 
the upper half plane that 


S(k', p) 
k' —k 


l 
= Sip) + > 6 Sk + x, p) — Sk — x, p)dx|x 


S(k, p) = Sip) + =P f dk (9) 


where Pf denotes a principal value integral which is to be extended 
from — œ to œ. Its meaning is evident from the second line of (9). 
Together with the requirement that S be unitary, (9) suffices to 
determine S(k, p) if Sı(p) is known. Gell-Mann has given a method 
to construct S(k, p) from Sı(p) on the basis of (9) and the unitary 
condition. (8? 


Actually, (9) has been proved only for two particular types of 
potentials : (a) those which decrease with increasing distance faster 
than any exponential 9), and (b) for a sum of Yukawa potentials 
with arbitrary ranges and strengths."10) It is known that (9) is not 
valid for certain other potentials."1) It should be possible, how- 
ever, to approximate any reasonable potential by a potential of the 
type (a) or (b) so that (9) should be of practical usefulness. To my 
knowledge, however, no practical use has been made of (9) in actual 
calculations of atomic collisions (12) so that we lack practical experience 
in the use of this relation. 


The third important development of a conceptual nature is Regge’s 
theory of complex angular momentum. The relation of this theory 
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to the Mandelstam representation is not yet quite clear and, for 
reasons which will become evident, I wish to postpone its discussion 
to the next section. This section will deal with a semi-phenomeno- 
logical theory in which I was very greatly interested some years ago 
and on which I wish to relate a few new results. Some of these are 
elaborations of Regge’s theory. 


II. R-MATRIX THEORY 


R-matrix theory (13° describes collisions in configuration space. 
It is necessary to subdivise this space into two regions: the so-called 
inner and external regions. In the latter, the colliding particles or 
the separating reaction products are either not interacting with each 
other any more, or their interaction can be described by a potential. 
The wave function and its normal derivative can be developed on 
the surface dividing internal and external regions into a set of ortho- 
gonal functions We; the expansion coefficients will be denoted by 
ve and de. The R matrix, which is a function of the energy, gives 
the linear relation between these sets of quantities 


vc = 2 Ree de . (10) 


The orthogonal set Ve is so chosen that it describes a definite pair c 
of colliding or separating particles. As a result, the wave function 
in the external region becomes 


Y = > re [acl e(re) =— b.Ocre)]Ve (1 1) 


where I. and Oe = Ie* are incoming and outgoing waves in the 
“ channel ’’ c; they depend on the energy and on the separation re 
of the pair c. The factor re`! is introduced so that the condition 
that the radial functions I. and Oc carry unit flux have the simple 
form 

d O. — O. a en 2ime 


irre ee 


(lla) 


where m. is the reduced mass of the pair c. The first essential equa- 
tion of the theory is then the expression for the collision matrix 


S = (I — ’'R)(O — O’R)-! (12) 


valid in this simple form if the surface between internal and external 
regions is so located that the wave function already vanishes at 
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this surface in closed channels.’ The I, I’, O, O’ are diagonal 
matrices; the diagonal elements of I are I.(a.) where a. is the separ- 
ation of the pair c at the surface dividing internal and external 
regions and I, O = I*, O' = [’* are similar matrices with diagonal 
elements dI/dre, O, dOldr., all at re = ae. The second essential 
equation of the theory is the expression for the elements of the 
R matrix 


Ree = > Ee (12a) 


where the y,. and E, are energy independent constants, the so- 
called reduced widths and resonance energies. 


R matrix theory has two principal weaknesses. The first of these 
is the necessity to introduce a largely arbitrary dividing surface 
between internal and external regions. Since the constants y,_ and 
E, depend to some extent on the location of this surface, they are 
not truly characteristic of the collision system. This dependence 
of the y,, and E, on the surface is, naturally, compensated by the 
dependence of the I, I’, O, O’ in (12) on the ae since S does not 
depend on the surface. The second weakness became evident when 
the dispersion relations were developed. These are sharper condi- 
tions on the analyticity of S than can be obtained from R matrix 
theory.15° The reason is that the collisions are described, in dis- 
persion theory, by energy (or wave number k) and momentum 
transfer p, whereas R matrix theory is formulated in terms of energy 
(or k) and angular momentum. The validity of dispersion theoretic 
relations made it evident that there must be interrelations between 
the E, and y,. for various angular momenta. However, prior to 
Regge’s work, there appeared no means to discover such inter- 
relations. 


As to the first problem, the results of the arbitrariness of the 
surface dividing internal and external regions, there is no complete 
answer. Several attempts 6) have been made to eliminate this 
arbitrariness, but they have achieved only partial success. However, 
the investigation of this problem has led to a precise formulation of 
the so-called sum rules 7 which appeared only as an approximate 
relation in earlier calculations 


(YT ret! + Ye ne Ye ie 72 ' rı 
> > a ag E a Och Ice . (13) 
A ÆA j È l 
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The detailed derivation of (13) will be published elsewhere. It shows 
a certain similarity to the Born-Heisenberg-Jordan formulation of 
the Reiche-Thomas-Kuhn sum rules inasmuch as the sum on the 
left is conditionally convergent and contains oppositely equal terms. 
Hence, if the order of summation is reversed, i.e., if one sums first 
over A, then over u, the result is oppositely equal to the right side 
of (13). Similarly, the commutation relation pg — qp = %/i, which 
was derived from the optical sum rule, gives a non-zero (in fact 
infinite) value to the trace of a commutator which also contains 
pairs of oppositely equal terms. The point which is remarkable 
about (13) is that it is valid for all locations of the aforementioned 
surface in configuration space. It may be worthwhile to repeat 
here the equations which give the dependence of the constants E,, 
‘yn, on the location of the dividing surface, as characterized by 
the a.. These are 


dE; 2me 
dae 7’ 


(Ve — Env nc? (13a) 


where Ve is the potential energy acting between the pair c at r = ae 
and 


Yet c we! 
= i (13b) 


ue rn 2 2 Me >. EE 
— Fy Yaeda? + zy AE, — Ve) 


da 
c ter 


The second point which I wish to mention is a generalization of 
the R matrix formalism which appears to be relevant also for 
Regge’s theory.18 This will be sketched only for the one-dimen- 
sional case, i.e., for pure scattering. Let us consider the double 
characteristic value problem 

7 

(— iu z a v)s = Ey2y + Lw2y. (14) 
In the application of the method which one has in mind, v = 1, 
w= l/r, L = — (f42/2m)1(1+ 1). The only relevant limitation of 
the development which follows from the mathematical point of view 
is that E and L are the coefficients of positive definite self-adjoint 
Operators; | is a function of r but depends also on E and L. The 
essential point is that, if we again define R by (10), that is, in the 
present case as the reciprocal logarithmic derivative of ) at r =a, 
the expansion (12a) remains valid for E (even though its coefficient 
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is not 1 but v2) and a similar expansion is valid in terms of L. In 
order to obtain this result, one has to modify the derivation of (12a) 
but slightly. One defines X, by the equation 


p] 


12 d ‘ 

(— = 5 — V—Lw®)x, = E,y°X, (14a) 
and the boundary condition (d/dr)X, =0 which is supposed to 
leave v and w self-adjoint. The vX, are then the characteristic 
functions of a self-adjoint operator «-1Qv-1 where Q is the operator 
on the left of (14a) which is, however, a function of L. If this has 
only a discrete spectrum, any function, and in particular vý, can be 
expanded in terms of the vX,. If v~!Qu-! has also a continuous 
spectrum — this is the case if L > %2/8m in the example mentioned 
after (14) — an integral has to be added to the usual linear sum of 
characteristic functions. However, we shall disregard here this 
possibility. The expansion coefficients b, of the vX, in the expansion 
of vb then can be obtained by the usual manipulation of (14) 
and (14a) and become 


> 
msn 4 2d | 
b = (=) EEL Sema L)|2dr ] — Ya, L). (15) 


The factor in the brackets is necessary because we have not normalized 
the X,. The y}? are the reduced widths of the usual theory; they 
are given by 


A2 1/2 X, (a, L) 
AD = (oa) TR Dea (5a) 
Hence, v (LP 
y. (L) 
RE, D= Deu (16) 
N A 


so that, for real L, the R(E,L) is an R function of E. Similarly, 
p (EY 
x 
R(E, L) = > —— 
(E, L) 256 — (17) 


and R(E, L) is also an R function of L for real E. Furthermore, 
since, when deriving (17) one uses the same X, as when (16) was 
derived, 


E [LE] =E or LED] = L (18) 
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for all E and L respectively. Similarly 
yy (L)[f]oXx(r, L)|2dr]/2 = B (E (LIS | wX (r, L)|2dr]!/2 (18a) 


and there is a similar relation in which E and L are interchanged 

Equation (18) simply means that the poles of R occur for the 
same E, L combinations in both expansions (16) and (17). Similarly, 
(18) gives the relation between the residues of R, depending on 
whether they are considered as functions of E or L. There are some 
further interesting properties of E,(L) and of L, (E) which will not 
be discussed, however. What one would like to obtain is, of course, 
a joint expansion of R in terms of E and L of which (16) and (17) 
are specializations. This does not seem to be quite easy, however. 
Such an expansion might well give the relation between the R(E) 
ior different L, the absence of which was mentioned as the second 
failing of R matrix theory. At any rate, the preceding discussion 
can be amplified considerably and shows that at least some of the 
properties of the complex angular momenta of Regge are shared 
by a great variety of other parameters. 

The question naturally arises concerning the minimum information, 
in two-particle scattering, which determines all the scattering cross 
sections. This question was solved, from one point of view, already 
by Levinson, Jost and Bargmann 9 who showed that if there are 
no bound states, the scattering cross section for any single angular 
momentum already determines the interaction potential. The 
situation is only slightly more complicated if there are bound states. 
Once the potential is determined, the scattering for any angular 
momentum can be obtained. The opposite question, that of obtain- 
ing the potential if the phase shifts for all angular momenta but 
only a single energy are given, was first raised and partially answered 
by Wheeler.20) The result recently obtained by Regge @!) in this 
connection is truly surprising: the scattering cross sections, for 
every value of the energy and angular momentum, are determined 
once the phase shifts for a single energy and any infinite arithmetical 
series of angular momenta are given. It is most unlikely that this 
follows from R matrix theory, even if it is extended in the way 
indicated. The reason is, probably, that R matrix theory remains 
valid if the interaction cannot be described by a potential but is a 
more general integral operator. Regge’s result is, most likely, con- 
ditioned on a potential type interaction, i.e., an interaction operator 
which commutes with the position operators. 
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III. METHODS AND APPLICATIONS 


By methods of collision theory one means approximation methods 
which then can be applied to specific problems. It is almost tauto- 
logical to say that no approximation method is perfect and it is 
usually not difficult to find a case in which it fails grossly. What 
is surprising, rather, is the wide applicability of approximation 
methods and the often almost uncanny instinct which some physicists 
have in the choice of these methods. 22? Some of the methods to 
be discussed were suggested by the mathematical structure of the 
basic theory, some were designed to meet particular conditions. 
There are three principal areas in which collision theory plays a 
decisive role : atomic collisions, nuclear collisions, and collisions 
between “elementary particles °. Methods have been developed 
to meet the conditions in each of these fields. These methods have 
found, however, applications also outside the area of their origin. 

The choice of approximation methods to be discussed in a short 
review may be as difficult as the choice of a method suited to a 
particular problem. I am well aware that my discussion will appear 
dilettantish to those who have worked with the method under dis- 
cussion. However, before proceeding with the discussion, it may be 
well to make another remark. 

As a result of the development and increasing use of computing 
techniques, the straight two-particle scattering problem, for a 
definite angular momentum, can be solved for any definite angular 
momentum with sufficient accuracy. This assumes, of course, that 
the interaction between the particles is known. Hence, the only 
significant two-particle problem which remains concerns the high 
angular momenta and there will be some remarks on this question. 
The other significant problems are, then: scattering of composite, 
particles and transmutations. It would seem logical, then, to devote 
the rest of this report to these questions. However, a detailed dis- 
cussion of these necessarily more specialized questions would render 
the remainder of this report too diffuse. Since, further, most of 
the points to be made can be well illustrated on simple two-body scat- 
tering, all examples have been taken from this most elementary field. 


(a) The Born approximation (5) 


This is the first term of the Born series, discussed before, and it 
is surprising how close it often comes to reality. The most noted 
case in point is, of course, the unrelativistic Coulomb scattering. 
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It should be noted, nevertheless, that the Born approximation 
often does give very inaccurate results. This will be the case, in 
general, if the actual wave function differs considerably from a plane 
wave. This will occur if the interaction is strong enough to distort 
the incident plane wave appreciably. In regions of configuration 
space in which the potential is strongly repulsive, the actual wave 
function will have a much smaller value than the plane wave. Hence, 
the scattering and reaction due to the interaction in these regions 
will be much smaller than calculated for an undistorted plane wave, 
that is, using the Born approximation. A strongly attractive potential 
affects the amplitude of the wave function less drastically and is 
less likely to lead to gross errors. Nevertheless, section (d) contains 
implicitly an example for this opposite behavior : it calculates the 
Born approximation for the scattering by an r~? potential in order 
to obtain the scattering by a potential which behaves as r-? at 
infinity but has no singularity at r = 0. This latter singularity does 
not affect the Born approximation for the higher angular momenta 
although it would affect the actual scattering. However, in practice, 
strongly repulsive potentials are much more likely to cause a failure 
of the Born approximation than attractive ones: they give a too- 
large cross section. In fact, the most common sign for the in- 
accuracy of the Born approximation is that it gives a very large 
cross section. | 


Whether or not the cross section is very large can be judged more 
easily if one uses the Born approximation for the spherical waves 
with definite angular momenta. In this case there is a natural 
measure, the maximum possible cross section 4rr(2/ + 1)/k2, with 
which to compare the approximate cross section. This is given by 


o = 4n(2l + 1)m212/R4k4 (19) 


k = J 2V0) Lii(kr)]2dr (20) 


the jı being kr times the spherical Bessel functions: j(kr) = sin kr, 
etc. One will expect the Born approximation to be valid or to fail, 
depending on whether or not 2mI/#2k is small as compared with 
One. However, this is not a safe criterion as it seems altogether 
not easy to give general criteria for the accuracy of the Born 
approximation. Thus, the scattering in a Coulomb field in a given 
direction is correctly given by the Born approximation even though 
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the integral for I diverges in this case. In this case, the total cross 
section is infinite and it is this circumstance that is reflected in (20) 
becoming infinite. On the other hand, in the case of a rapidly- 
decreasing potential (19) is surely grossly inaccurate if 2mI/A2k is 
of the order of 1 or larger. Conversely, if 2mI/%2k is small, one 
can reasonably assume that (19) is accurate. There is one well- 
known exception to this : a resonance is not reproduced by (19) 
so that (19) surely can give far too small cross sections over narrow 
energy intervals. | 

A further point worth remembering is that, for a potential which 
is attractive in some, repulsive in other, regions, I, and hence the 
Born cross section, may well vanish for a definite k. This is a 
consequence of the real nature of I and is a rather general occurrence 
in Born approximation. The actual cross section, however, vanishes 
only under exceptional circumstances @3’ and both theoreticians 
and experimental physicists have learned to view with suspicion 
the zeros in the cross sections, or in the angular distributions, 
predicted by the Born approximation and some of its generalizations. 

It hardly needs to be repeated that the preceding discussion, 
restricted as it was to the simplest possible case, does not do justice 
to the immense area of applicability of the Born approximation. 
In fact, the principal value of Born’s method may well lie in the 
ease with which it gives, for almost every collision problem, at least 
some guidance on what to expect. This is only indicated by the 
simplicity of the formula (19) which, however, could well serve to 
illustrate its strengths as well as its weaknesses. 


(b) Distorted wave Born method (24) 


The Born approximation becomes inaccurate if the potential 
modifies the wave function considerably. Thus, in the hard sphere 
case (V = œ for r <a) the Born approximation gives an infinite 
cross section even if a is small, so that the actual cross section is 
quite moderate. The reason is evident: the calculation assumes 
that the incident wave penetrates to r< a and is very strongly 
scattered there. Evidently, the accuracy of the method could be 
improved considerably if at least the bulk of the effect of the poten- 
tial could be taken into account to begin with. This is the idea of 
the distorted wave Born approximation. 

The distorted wave method can be used to calculate scattering; 
more commonly it is used to calculate reaction cross sections. The 
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scattering amplitude consists of two parts : the one inherent in 
the distortion, and the other due to the part of the interaction which 
is not taken into account by the distortion. Thus, in the afore- 
mentioned case of a hard sphere, the /= 0 part of the distorted 
wave would be sin k(r — a) for r> a, and this would completely 
describe the effect of the hard sphere. However, if the potential 
extended beyond the r = a point, this could be taken into account 
approximately by the first term of the power series development of 
the solution of the wave equation, in terms of the residual potential. 

The most important areas of applications of the distorted wave 
method are, however, problems of reaction and transmutation. 
When calculating these, it is important not to permit the incident 
wave to have a considerable amplitude in regions of configuration 
space where the interaction causing the reaction is strong but where 
the incident wave can hardly penetrate. If one wants to preserve 
symmetry between initial and final state (and hence preserve the 
principle of detailed balance), not only the wave function of the 
colliding particles but also that of the reaction products must be 
distorted, that is at least crudely adjusted to the prevailing potential. 

The method as described in the preceding paragraph assumes 
that the interaction can be separated into a part which modifies 
the wave function of the colliding particles (and may have a similar 
effect on the wave function of the reaction products) but does not 
affect their identity, and another part which is responsible for the 
reaction or transmutation processes. The simplest way to obtain 
an expression for the first part of the interaction is to calculate its 
expectation value for the wave function of the colliding particles 
(or of the reaction products) as function of their distance and use 
this expectation value as distorting potential. The starting point 
for the Born series is, then, the scattering wave function in this 
potential. From what was said about the hard sphere case it should 
be evident that this will give a considerable improvement over the 
undistorted wave method and will be, in fact, a good approximation 
if the second part of the interaction, leading to reactions, does not 
modify the wave function seriously. 

If the wave function is considerably modified by the part of the 
interaction which leads to reaction or transmutation, it may be 
Possible to bar the incident particles from the region which they 
cannot reach, because they are transmuted before, by introducing 
an absorptive (complex) potential. This is a rather customary 
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procedure in nuclear physics and has proved very fruitful.25° How- 
ever, no motivation from first principles seems to be available. 26 
It is clear, in particular, that there must be a consistency condition 
between the magnitude of the absorptive part of the distorting 
potential and the result of the subsequent calculation of reaction 
probability. Similarly, in order to preserve the principle of detailed 
balance, the reaction products should also move in a complex 
potential. Consistency conditions of this nature have been applied 
in the past only sparingly and it is even customary to treat the 
distorting potential as an adjustable quantity. This is justifiable if 
One wishes to obtain a first orientation, but not in a definitive 
theory. | 

The distorted wave Born approximation has played an important 
role in the theory of atomic collisions much before its recent use for 
nuclear phenomena. In particular, Zener’s calculation 27’ of the 
excitation of vibrations in molecules by atomic impact is based on 
this method, as are also various modifications and refinements of 
his work. In this case, the method seems to give satisfactory results 
which is, unquestionably, at least in part due to the fact that the 
cross sections are small, i.e., the transfer of translational energy 
to vibrations is often a notoriously slow process. However, the 
distorted wave Born method has been used also to calculate the 
excitation of rotational degrees of freedom and applied also to some 
other processes to which it is, probably, less applicable. It is almost 
as easily handled as the straight Born.approximation and is un- 
questionably more accurate. The report of O.K. Rice reviews these 
applications. 


(c) Classical path, quantum interaction 


This is, in fact, a variant of the previous method, based on the 
fact that in many cases the motion of the colliding bodies can be 
better described by classical theory than by a crude approximation 
to the solution of the distorted wave equation. Recently, this 
method, originally devised for describing chemical processes (28) 
has been applied with great success to transmutations occurring in 
the course of heavy ion collisions. 29 

If the paths of two atoms cross at a reasonable distance and with 
sufficient relative velocity so that the energy transfer cannot be 
expected to affect the paths substantially, the paths can be calculated 
by means of classical theory. with reasonable accuracy. One then 
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obtains the interaction operator affecting the internal coordinates 
of the two atoms as function of their position or, on the basis of 
the preceding calculation, as function of time. The transitions 
caused by this time-dependent interaction can then be calculated by 
means of first order perturbation theory. 


The method is particularly adapted to calculate electronic excit- 
ations or vibrational or rotational transitions. The use of classical 
theory for the calculation of orbits is unquestionable if the impact 
parameter is so large that the orbits could be traced out by the 
motion of wave packets. In order to have appreciable transition 
probabilities in spite of large impact parameters, the interaction 
energy must decrease slowly with increasing distance of the particles. 
This condition is satisfied most accurately if one of the particles is 
charged and its field, varying at the site of the other particle, induces 
transitions therein. The calculation of this effect is probably the 
most important application of the method in the theory of heavy 
ion collisions. However, it can be expected also to give a good 
description of the collision excitation of molecules with dipole 
moments. 


(d) The WKB method 


This method is intermediate between the distorted wave Born 
approximation and the preceding method, inasmuch as it takes the 
quantum character of the motion of the colliding particles at least 
approximately into account. The principles of the method, as 
usually applied 60, are probably too well known to need further 
comment. There is a number of subtle points in connection with 
the replacement of I(l + 1) by (l + 1/2)? and the behavior of the 
wave function near the classical turning point. I would like to call 
attention, however, to a modification of this method, using the semi- 
classical distribution function.9! It is possible, apparently, to avoid 
some of the ambiguities in this way. However, as far as known to 
me, the method has been applied so far only to two-body scattering. 


There is a remarkable difference between the classical and the 
quantum theory of scattering which has not been much emphasized, 
Perhaps because it has no obvious visualization. In classical theory, 
the cross section is always infinite if the forces have an infinite range, 
no matter how fast they go to zero with increasing distance. The 
reason is that two particles, no matter at what distance they pass by 
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each other, suffer some deflection. Hence, the cross section is always 
infinite for very small scattering angles, and it is often infinite 
also 32) for a scattering angle =. In quantum mechanics this is not 
the case; if the potential drops fast enough, the total cross section 
remains finite; for an even faster drop, even the differential scattering 
cross section at very small angles is finite. It may be of some interest 
to ascertain how fast the potential must drop so that these conditions 
are satisfied. 


The total cross section 1s 


dr 
= 2 
l 


(2! + 1) sin? (21) 


Gt 


where nı is the phase shift of the partial wave with angular 
momentum /#. The forward scattering cross section is 


a0) = 252 | (2! ~ 1X(e 
l 


2i7, 
a 


2 
1)|. (22) 


Hence, since at any rate yı > 0 for large /, the total cross section 
will be finite if the series & (27 — 1)nı? converges, the zero angle 
differential cross section will be finite if E (2/ + 1)y: converges. 


It is not difficult to show that, for large /, the Born approximation 
gives a correct phase shift. This follows essentially from the 
magnitude of the Born scattering cross section. 


o = 4r(m2/h4k4)\(21 + 1)? | (23) 
where I; is given by (20). Hence, one will write 
sin ni = — (m/k). (24) 


If this is very small, the condition for the validity of the Born 
approximation is satisfied and one can prove its actual validity also 
more rigorously. We shall use a potential r-? because the calcul- 
ation is easiest for this potential. Actually, this potential gives 
infinite scattering, because of its singularity at r = 0, but it can be 
shown that I would be asymptotically the same for all potentials 
which become r~? at large distances. Hence, we calculate 33) 


por Ja z(p — 2)! @ — p/2 + 1/2)! 
o r? 2P[(p/2 — 1)! PU + p/2 — 1/2)! 
T/P—? L 1)-p+l 
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Hence, the total cross section will be finite if the potential decreases 
faster than r-2; the differential cross section at zero angle will be 
finite if it decreases faster than r-3. It is interesting that, in the 
present case, the Born cross section is essentially correct where it 
is largest, namely at zero angle. Actually, for a potential which 
goes to zero as r-?, the differential cross section at a very small 
scattering angle % is proportional to 92? -6 as long as 1 <p < 3. 


(e) Variational principles 


The great accuracy of the variational calculation of discrete energy 
levels, particularly that of the normal state, suggests the use of this 
principle also for collision processes. The early investigations of 
Hulthén 34) and of Tamm (5? were concerned only with scattering 
but Lippmann and Schwinger (36) extended the method to arbitrary 
collision processes. 


There is at least one great difference between the application of 
the variational principle to the calculation of the energy of the 
normal state and its application to obtain scattering cross sections: 
the sign of the error which is positive in the former case remains 
unknown in the latter. As a result, in the case of collisions it is, 
as a rule, not possible to tell which of the trial functions gave the 
more accurate result. This impairs the usefulness of the method 
to some extent : its effectiveness is so striking in many cases that it 
is surprising that it is not used more generally. 


The problem is to find an expression for the phase shift, or the 
collision matrix element, or something equivalent, in terms of a 
trial wave function suggested by simplicity or physical intuition. 
This expression should be, furthermore, insensitive to deviations of 
the trial wave function from the accurate one. More precisely, it 
is demanded that the error in the wave function enter only quadra- 
tically into the error in the phase shift or collision matrix element. 
This can be always accomplished by adding the first correction to 
the original value of the physical quantity in question. The remain- 
ing error will then be of second order. Let us consider a simple 
Scattering problem 


(H — E)} = (Ram) Vy — Ed = 0 (26) 
and assume that | assumes, for large r, the form sin kr + c cos kr. 
The constant c = tan y can then be regarded as the tangent of an 
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angle n which is called the phase shift because, for large r, the wave 
function becomes proportional to sin (kr +). Similarly, let us 
consider a trial wave function U; which we hope is not too different 
from J. At large r, it should approach the form sin kr + c; cos kr, 
corresponding to a trial phase shift ns for which c: = tan ne. If 
ve is inserted into (26), the right side will not vanish; we denote 
it by e: 

(H — El: = — (2/2m)be" + Vu — Ede = er (26a) 


One then easily derives the equation 
oO 
(A2/2m)k(c — ce) + f verdr = 0. (27) 
o 


The integral converges because U; is a solution of (26) for large r 
so that e, goes to zero. One can give (27) the form 


(e) je 6) 
(#2/2m)(ktann — k tan yz) + i pedr = Í (be — V)erdr. (27a) 


Both integrals converge. The expression on the right side is indeed 
second order in the deviation of U: from yọ so that it can be assumed 
to be small. 


If one believes that the trial wave function is already reasonably 
close to the true wave function, (27a) gives directly 


CoO 
tan y = tan ne — (2m/h2k) f deerdr (27b) 
o 


this being correct to the second order in the deviation of 4; from Y. 
This is the simplest and most common way to use the variational 
expression (27a). 


If one is not ready to assume that a trial wave function of 
sufficient accuracy can be guessed, one can use an analysis which 
shows a certain similarity to the application of the variational 
principle to the determination of the energy of stationary states. 
Nevertheless, there are significant differences which render the 
variational principle for scattering less effective than for the deter- 
mination of the energy of bound states. The principal reason for 
this is that the latter principle represents, at Jeast for the norm 
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state, a minimum principle whereas the sign of the right side 
of (27a) can be positive as well as negative. Similar to the trial wave 
function for bound states, one assumes for the scattering problem 
also a trial function of considerable flexibility 


N 


U(r) = sin kr — a f(r) cos kr + > fatal?) (28) 
l 


where f(r) is some function, such as | — e-%, which vanishes at 
r = 0 and approaches | at large r. The u„(r) also vanish at r = 0 
so that Wẹ satisfies the boundary condition (0) = 0. At large r, 
the u„(r) approach zero since these functions are square integrable 
and the asymptotic behavior of b: becomes sin kr + c cos kr. 
The functional form of f(r) and the un, as well as their number N, 
are dictated by physical intuition; the u„ are introduced to permit 
the trial function to approach the real F also for small r. 


If U(r) happens to be the true 4, both sides of (27a) will vanish. 
If the ta and ng differ from the values which give the true | by small 
amounts, the two sides of (27a) will differ from zero by an expression 
which is second order in „— v: and the deviations of the t, from 
their real values. It is reasonable, therefore, to determine the tn 
and n by the condition that 


L = (A2/2m)k a — | drerdr (29) 
Ö 


be an extremum. This will, indeed, lead to correct equations when- 
ever U(r) can be expressed in the form (28) with suitably chosen 
in and ve. 


If one calculates the integral in (29), one obtains terms which 
are second order in the c; and tn, something like acı? + c:Lantn 
+ Zanmtntm. In addition, however, one also obtains linear terms 
in cz; and the fn, coming from the sin kr term in (28), and even a 
term which is independent of the c; and tn.’ Hence, if one sets the 
derivative of (29) with respect to the cs and tn equal to zero, one 
obtains N + 1 linear inhomogeneous equations for the c; and tn 
which have, in general, a unique solution. This is different from 
the case of the application of the variational principle to the discrete 
spectrum, that is to bound stationary states. This leads to homo- 
geneous linear equations and a secular equation for E to assure the 
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compatibility of these homogeneous equations. In the present case, 
of course, the energy need not be adjusted. Once the ft, and c: are 
obtained, the corresponding value of tann =c can be obtained 
by evaluating L and using (27) 


tan n = c = (2mjisk)L. (30) 


Again, this is different from the stationary case for which the energy 
is obtained as f W: Ht. The calculation of the parameters (c: and 
tn in the scattering case) is in principle simpler for the scattering 
than for the stationary case. On the other hand, whereas one can 
show by the minimum principle that the addition of further terms 
to the trial wave function of the stationary case always improves 
the result (lowers the energy), the addition of further terms 
In +1Un,+ ı(r), -- may render the value (30) for ') less accurate. 
One can, of course, say that the earlier, better value was more 
accurate fortuitously, but the fact is that it is difficult to judge the 
accuracy of the n, obtained by the variational method, unless e: = 0 
in which case, of course, 7 = yz and the trial wave function was 
accurate. 


There are other variational methods 37’ which permit an estimate 
of the error in . These are based on the calculation of the expression 


JIH — E)yr]?ar (31) 


which, of course, does obey a minimum principle. However, the 
calculation of this expression seems to be very cumbersome, just 
as in the stationary case. Under certain conditions (e.g., at very 
low energy), the L of (29) also obeys a minimum principle.38’ In 
such a case, it should be on a par with the variational method for 
the energy of stationary states. However, no systematic theory 
could be developed so far to obtain general variational expressions 
of this nature. 


Kato, and also Spruch and Kelly 37) observed that the phase 
shift increases if the potential is decreased (or made more negative). 
This suggests the use of a method similar to Bazley’s 3% to obtain 
upper and lower limits for the phase shift, by approximating V, 
both from above and below, by potentials for which the scattering 
problem can be solved accurately. Unfortunately, there are few 
such potentials. 


232 


244 Part I. Physical Chemistry 


The variational method, as all other methods, was discussed in 
its application to simple scattering. However, most of the preceding 
considerations can be extended also to problems of reaction and 
energy transfer. Unfortunately, one often has the impression that 
the variational principle is used more often to demonstrate its power 
than to solve new problems. It is surprising how often (27b) gives 
a close value for v, even for a very simple ys. 


THE IMPULSE APPROXIMATION (4°) 


Dr. Karplus drew attention during the meeting to the impulse 
approximation. This approximation is particularly simple to apply 
to the collision of a single particle with a system of other particles, 
the latter forming a bound state. It was originally devised to describe 
the collision of a single nucleon with a complex nucleus. It could 
be used also to describe the collision of an electron, or of a proton, 
or perhaps even of an atom, with an atom or a molecule. 

The impulse approximation is somewhat similar to the Born 
approximation inasmuch as it is particularly useful at high energies. 
There are n terms in the potential energy between an incident 
particle and a complex containing n particles. These correspond to 
the interactions of the incident particle with each of the particles 
of the complex. Each term in the potential energy gives rise to a 
term in the scattering amplitude of the Born approximation. This 
is true also of the impulse approximation, the only difference being 
that the terms of the Born approximation are approximate expres- 
sions for the scattering by a single particle whereas the impulse 
approximation uses the accurate expressions for these quantities. 
However in both approximations the total scattering amplitude is 
simply a sum of the scattering amplitudes due to the particles of the 
complex. One reason for this not giving the accurate value is that 
the incident particle can be scattered by two or more particles in 
succession and the second and higher members of the Born series 
indeed contain terms which correspond to such events. Another 
reason is that the particle which scattered the incident particle will 
continue to interact with the other particles of the complex also 
after having collided with the incident particle, and the impulse 
approximation does not take this interaction into account. In fact 
the properties of the bound state of the complex enter the calcul- 
ation only by determining the momentum distribution of the 
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constituents. The total scattering amplitude is the superposition of 
the scattering amplitudes which correspond to the momentum 
spectra of the constituents. 


It follows from the preceding description of the impulse approxim- 
ation that it can be used only if the scattering of the incident 
particle by the individual constituents of the complex is already 
known. However, if this is the case, it can be used just about as 
easily as the Born approximation. Unfortunately, no very detailed 
and conclusive discussion of the impulse approximation is available 
but the consensus is that it gives useful results only if the energy 
of the incident particle is much higher than the binding energies of 
the constituents of the complex. Only if this is the case, can the 
scattering by bound particles be treated as if they were free; as was 
mentioned before, the bound nature of the scattering particles 
manifests itself only by their having a spectrum of momenta, rather 
than a single momentum. 


The impulse approximation has not played a significant role so 
far in the theories of atomic or molecular collisions. 
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Discussion du rapport de M. Wigner 


M. Karplus. — I should like to emphasize Prof. Wigner’s 
reference to the way properties of the bound state are treated in the 
impulse approximation. Because of the binding, the single particle 
in the bound system which acts as target is not in a state of known 
linear momentum (at rest, for example). Instead it is in a super- 
position of momentum states, a superposition which is described 
by the fourier transform of the target particle’s spatial wave function. 
The total scattering amplitude from this one target particle, then, 
is obtained by superposing the amplitudes which correspond to 
the various momenta in the momentum spectrum of the target 
particle, each weighted by the amplitude for occurrence of this 
momentum. The total amplitude for scattering by the bound system 
is finally obtained by adding the impulse approximation amplitudes 
for each of the particles in the system considered as target. As 
Prof. Wigner has mentioned, many other important effects of bind- 
ing are neglected. 


The impulse approximation in nuclear physics has been parti- 
cularly useful when the momentum transferred to one target 
particle in the collision is large compared to the momenta in the 
initial distribution. In atomic and molecular problems this relation- 
Ship usually does not hold. 


M. Shuler. — Prof. Wigner has just given us a beautiful and 
lucid talk on scattering theory. As he has pointed out, all the success- 
ful applications are in the fields of quantum electrodynamics, 
nuclear physics and electron-atom (or molecule) scattering. As 
far as I know from my own work and from the literature, there 
have been essentially no successful applications of modern quantum 
Scattering theory to inelastic molecular collisions such as, for 
instance, the dissociation reaction 


AB+M+>A+B+M. 
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I wonder if one of the practical problems in the calculation of 
inelastic atomic and molecular scattering is not the absence of a 
convenient and reasonable expansion parameter. In most of the 
successful applications of scattering theory, such as scattering of 
electrons or photons from atoms or molecules, there is such an 
expansion parameter — either the mass ratio of the scattering 
centers (me/Matom) or an energy ratio. I would appreciate to hear 
Prof. Wigner’s views on that. 


M. Wigner. — Perhaps in my desire to report on the general 
aspects on collision theory, I have underemphasized the theoretical 
treatment of phenomena of particular interest to our conference. 
As for these problems, interesting results have been obtained not 
only on the theory of the excitation of vibrational motion but also 
on the electronic excitation by atomic collisions and a number of 
other phenomena which I am sure will be discussed in the course 
of our conference. 


As to the main point made by Dr. Shuler, I agree fully. Except 
for the very simplest systems, all successful approximation methods 
are based on the circumstance that some parameter or quantity is 
small. In addition to those enumerated by Dr. Shuler, I might call 
attention again to the case in which the only process of high probability 
is that of scattering. Furthermore, it may be possible to treat this 
accurately. All the other processes, such as vibrational or electronic 
transitions, are then unlikely and their probability is effectively 
Dr. Shuler’s small parameter. I realize that I have now restated 
the conditions for the validity of two of the approximation schemes 
which I discussed : the distorted wave Born approximation and the 
classical path, quantum interaction method. However, the validity 
of the other approximation methods, with the possible exception of 
that of the variational method, can also be shown to depend on 
the small value of some parameter. 


As to Dr. Shuler’s first question, it is true that the most recent 
theoretical treatment of the association reaction with which I am 
familiar, that of Keck, is not based on collision theory, but uses 
the transition state method. As a matter of fact, I consider it the 
main result of the transition state method that it shows that rate 
processes can be calculated without a detailed description of the 
collisions. This was so far possible only in classical, that is non- 
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quantum, theory and quantum effects could be taken into account 
only approximatelv. Theoretically, this is a serious limitation in 
our treatment of association reactions. In practice, the limitation 
is less significant because quantum effects are, as a rule, not very 
important in association reactions. 


M. Widom. — The Born series goes in inverse powers of 
Planck’s constant. One knows the practical usefulness, however, of 
expressions for the cross-section which go in increasing powers, 
the leading term of which is the classical theory. The question 
then arises, whether the analytical properties of such series might 
not lead to results which sumplement in a valuable way the conclusions 
based on the analytical properties of the Born series. 


M. Wigner. — Dr. Widom’s suggestion seems to me to be a 
very interesting one. As a matter of fact, some hint in the direction 
which he mentions is contained in the written report which I 
presented, even though I omitted this part in my verbal presentation. 
R2 is the coefficient of a positive definite operator in the Hamiltonian 
so that some of the remarks contained in the section on R matrix 
theory apply to it. However, this point has not been worked out 
to date. Further, lest too much significance be attributed to the 
considerations in my report to which I am referring, I wish to re- 
emphasize that the behavior of S in the complex plane at infinity, 
which plays such an important role from the point of view of the 
dispersion relations, does not follow from R matrix theory. 
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In: Perspectives in Quantum Theory; Essays in Honor of Alfred Lande 
(W. Yourgrau, A. van der Merwe, eds.). 
M.I.T. Press, Cambridge, MA 1971, pp. 25-36 


I. 
There seems to be a revival of interest in the formulation of quantum 
mechanics in terms of distribution functions, i.e. functions P(q, p) 
of positional and momentum coordinates q and p. The function P 
represents a quantity which is, in the classical limit, the phase- 
space distribution function; it gives the probability that the co- 
ordinates and momenta have the values g and p. The existence and 
properties of such distribution functions are closely related to the 
question of the possibility of the reformulation of quantum 
mechanics in terms of classical concepts and are, therefore, close 
to Lande’s interests. The exact definition of P when the quantum 
conditions, in particular the uncertainty relations, play a role is 
not clear. One wishes to define, for every normalized state vector 
W, a corresponding distribution function P. This is postulated to 
satisfy certain conditions which the classical phase-space distribution 
also satisfies, but these conditions do not determine P completely. 
The choice between the P which satisfy a set of specified conditions 
is made aiming at greatest mathematical simplicity and in such 
a way that the P chosen becomes a useful tool for carrying out 
calculations of quantities which are not easily obtainable otherwise. 
Two reasonable conditions which can be imposed on P are: (a) 
that it be a Hermitian form of (x), i.e. that the P which describes 
the same state as the wave function y be given by 


P(q, £) = (Ww, M(q, p)W), [4.1] 


where M is a self-adjoint operator depending on and g and (b) 
that P, if integrated over p, give the proper probabilities for the 
different values of q as 


f P(q, p) db = W(9)I?, [4.22 
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and, if integrated over g, give the proper probabilities for the 
momentum as 


f P(q, p) dq = (27h)! 


f br) is f [4.2b] 


All integrations, unless otherwise noted, are to be extended over 
the whole range of the variables, from — oo to +œ. It may be 
observed also that, even though the conditions which ? should 
satisfy are formulated above for a one-dimensional problem, they 
can be easily extended to an arbitrary, many-dimensional one by 
making g and p vectors with as many components as has the con- 
figuration space of the w for which P should give an alternate 
description. 

A somewhat milder form of condition (b) is that P should give 
the proper expectation values for all operators which are sums of a 
function of p and a function of q as 


f PE DL + earl aa = (ur) + sn] v). 
[4.2] 


It will suffice to use condition (b) in the form [4.2]. 
A third, very natural, condition on P would be that P is non- 
negative for all values of p and g: 


P(p,q) = 0. | [4.3] 


It has been stated in an article by the present writer where a ? 
satisfying conditions (a) and (b) was given! (actually for the many- 
dimensional case) that these conditions are incompatible with [4.3]. 
It has been variously suggested to the writer that this proof be 
published, and this will be the first subject of the present article. It 
is sufficient, for this purpose, to consider the one-dimensional case. 

In order to carry out the proofit will be shown that the assumption 
that a P satisfying all three conditions [4.1], [4.2], and [4.3] can 
be defined for every y leads to a contradiction. Actually, in order 
to obtain the contradiction, it will suffice to consider such yw which 
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are linear combinations ab, + bir, of any two fixed functions such 
that y, vanishes for all x for which y, is finite. We start with the 
following lemma: 


LEMMA 1 

If w(x) vanishes in an interval J, and if g(q) is zero outside this 
interval, and nowhere negative therein, one has for the P corres- 
ponding to the y above 


| P(q, pe(q) dg = 0 [4.4] 


for all p (except for a set of measure zero). This follows from [4.2] 
with f = 0: the integral of [4.4] with respect to p, 


Í Í P(q, P)e(q) ab dg = I, e(x)¥] = 0, [4.42] 


vanishes because the right side of [4.2] vanishes. However, the 
integrand with respect to p, that is the left side of [4.4], is non- 
negative for the g postulated as long as [4.3] holds for ?. It follows 
that the integrand with respect to p must vanish except for a set of 
p of measure zero. Since, furthermore, [4.4] is valid for every 
function g(g) which satisfies the conditions of Lemma 1, we can 
conclude in a similar way that 


LEMMA 2 

If w(x) vanishes in an interval J, the corresponding P (q, p) vanishes 

(except for a set of measure zero) for all values of g in that interval. 
Let us consider now two functions y, (x) and y,(x) which vanish 

outside of two nonoverlapping intervals /, and /,, respectively. 

Because of [4.1], the distribution function P(g, p) which cor- 

responds to ay, + by, will have the form 


Plg p) = lal?P, + a*bP,, + ab*P,, + |b|*P>. [4.5] 


Setting b = 0, we note that P, is the distribution function for wp, 
and similarly P, is the distribution function for y,. Let us consider 
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[4.5] for the q outside the interval J,. Since P, vanishes almost 
everywhere for such g, the distribution function [4.5] cannot be 
positive for all a and b unless both P, and P,, vanish (except for 
a set of measure zero in g and p) if g is outside J,. The same can be 
concluded as long as q is outside /,. Hence, we have, instead of 
[4.5], almost everywhere 


Pa(2) = lal Pilg $) + Ib P(g, b). [4.6] 


This means that the distribution function P, is (almost everywhere) 
independent of the complex phase of a/b. This seems, at least 
intuitively, absurd if P,, is to give the proper momentum distribu- 


tion, i.e. is to satisfy [4.2b]. 
In order to demonstrate this, let us denote the Fourier transforms 


of yw, and y, by g,(p) and @,(p). Equation [4.2b] then reads 


ja]? f P (4, p) dq + (01? f P,( p) dq 
= jal7|o,(p)|? + löl?le2(p)l? + 2Re ab*p,(p)o:(p)*. [4.7] 


Since this must be valid for all a, b, we must have identically in p 


Qy(p)p2(p)* = 0. [4.7a] 


This is, however, impossible since p, and @., being Fourier trans- 
forms of functions restricted to finite intervals, are analytic functions 
(in fact, entire functions) of their arguments and cannot vanish 
over any finite interval. This then completes the proof that no 
nonnegative distribution function can fulfill both postulates (a) 
and (b). 


Il. 

Since a bilinear distribution function which gives the proper 
probabilities for position and momentum (i.e. satisfies the two 
conditions of the preceding section) cannot be everywhere non- 
negative, this last requirement will be abandoned for the rest of 
this article.” The question then arises of what other natural re- 
quirements on the distribution function can be made and to what 
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degree these determine the correspondence between wave functions 

and distributions. When investigating this question, it will be con- 
venient to relax the mathematical rigor of the preceding section. 
In particular, the operator in (I) will be replaced by a kernel so 
that we will have 


P(4, $) = f f K(4, p; x, XYY (K) de di. [4.8] 


The kernel K will be permitted to be a singular function, though. 

The first and most natural requirement on the correspondence 
[4.8] is that it be Galilei invariant. This means, in one dimension, 
first, that it be displacement invariant 


P(g + a,b) = Í K(q, p; x, "bl + a)" (x! + a) de dx’. 


Substituting x and x’ for x + a and x’ + a on the right side, and 
considering that [4.8] must be valid for all a, one easily infers that 
K can depend, in addition to p, only on the differences q — x and 
q — x’, or on 2q — x — x’ and x — x’, not on all three quantities 
q, x, x’. This was to be expected. 

The second requirement of Galilei invariance is that if W(x) is 
replaced by w(x)e'?*", the new distribution function becomes 


P'(q, p) = Pip — £’) 
P'(4, $) = f K(q, p; x, x") (x) Fy (x ET dx dx 
= PP ~ #) = || Kap - ir AWA NE) de ar. 
[4.9] 
Since this must be valid for all Y, we have 
K(q, p; x, x) =Ë = K(g, p — p'; x, x’). 7-10] 
Substituting p’ = p, one finds 


K(q, p3 x, x’) = eX XIN K g, Osa) 
= PAKO I — x — x’, x — x’). [4.11] 
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Since K depends only on the differences 2g — x — x’ and x — x, 
a function K° of these can be substituted for K(q, 0; x, x’). We now 
have 


Pap) = SÈK — x - x,a - yery layl) ded 
[4.12] 


We shall find, next, the restrictions on K? implied by [4.2] and 
[4.2a]. We have, according to [4.2a], 


Í P(q, p) dp 
J j K°(Qq — x — x’, x — x)? MY (x) rl) dx dx' dp 
= anh | K(2q — 2x, OG)? d = WOOL [4.13] 
from which it follows that 
QnhK°(2q — 2x, 0) = ôlq — x) [4.14] 
or 
K°(q, 0) = + 6(q). [4.142] 
th 


Similarly, [4.2b] gives 


[ran a 
= f | f K? (2q — x — x’, x — Ay WMG (x) Wy (x’) dx de’ dq 
= (2nh)~} | Y (x) rel x eT PA dx dx', [4.15] 
from which it follows that 
free — x = x',x — x’) dq = (Anh)"!, | [4.16] 
or 


[r« x) dq = (nh)7?. [4.16a] 


Quantum-Mechanical Distribution Functions Revisited 257 


3] QUANTUM-MECHANICAL DISTRIBUTION FUNCTIONS 


We can next introduce the requirement that the correspondence 
[4.8] be invariant with respect to space and time reflections. The 
former requirement demands that a simultaneous change of sign 
of all the variables of K leave it invariant. This means that K°® is 
left unchanged if the signs of both of its variables are changed. 
Time-inversion invariance demands that replacement of y by w* 
change p into —p. It follows that 


K(q, —p3 x, x’) = K(q, p; x', x) = K(q, p; x, x")*. [4.16b] 


The last part of [4.16b] follows from the Hermitian nature of K, 
required to give a real P. The first part of [4.16b] renders K° even 
in its second variable—it is therefore even in both of its variables. 
The second part of [4.16b] renders K? real. Therefore, K? must be 
a real and even function of both of its variables. Evidently, these 
conditions do not fully determine K? so that a great deal of 
arbitrariness remains in the definition of P. 

A further condition which can be imposed is that the corres- 
pondence between P and the wave function be symmetric in p and 
q (apart from the sign of i), so that one has, in addition to [4.12], 


also 

Pane) = || Ki, g: k Koto) dt d, [4.17] 
where 

o(k) = (Anh)? ll W(x)e7 2" dy, [4.172] 


This condition will be satisfied if 
[[ KGa 5% E) der 


= f f f | K(p, q; k, k')* (Och) Ip) y (x eiT de dx! dk dk’. 
[4.18] 


258 Part I. Physical Chemistry 


32 EUGENE P. WIGNER 


It follows that 
2nhK(q, p; x, x’) = IEG q; k, k) ži dk dk’, [4.18a] 


or, with [4.11], and since K° is real 


OnhK° (2q — x — x’, x — x'e PEx 
7 IESE — k = k, k — kyla tke keV dk dk. [4.197 


Introducing p = 2p — k — k' and o =k — k' as integration. 
variables and writing č for 2q — x — x and y for x — x’, the 
equation simplifies greatly, and one has 


K°(&,n) = (trh)? ji K? (p, ae” “+ oDI?R dp do. [4.207 


This means that K? belongs to the characteristic value 1 of the 
operator Q with the kernel 


Q(E, n; p, o) = (Anh) ter term, [4.21] 


The operator Q shows great similarity to that of the Fourier trans- 
formation. In particular, its square transforms a function f (p, ©) 
into f(—p, — c), so that, if f is even in p, Ø, 


K? = (1+ Qf [4.22] 


will satisfy [4.20]. One can verify, furthermore, that if f (p, 0) = 
6(p)/nh, then Qf satisfies [4.16a], and conversely, if f satisfies 


[4.16a] so that ro. a) dp = (nh)~1, then Qf(£,0) = (Erh. 


Hence, if f satisfies both conditions, the same will be true for Qf. 

It follows that the condition of the symmetry between q and f, 
as expressed by [4.18], still leaves a great deal of freedom in the 
choice of K. This 1s essentially restricted to one-quarter of a Hilbert 
space. 
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The original choice of the correspondence between P and the 
wave function was? 


P(q, p) = (kn)! f la + lg — ae? dy, [4.23] 


This corresponds to 
K? (q, x) = (An)! 6(9), [4.23a] 


and this is, evidently, the simplest way to satisfy all the preceding 
requirements. However, this is not the only possible choice: 
another, relatively simple selection is 


l in] 
KOE a 2 4.24 
(č, n) mIRE 4 p? [4.24] 


leading to 


_ Ix — x eiP(=— x’)/R az , 
P(q, b) -h EE ee Be Syn W(x’) dx dx 


a ip(x—x’')/h 
= | eae lg + rl + 2!) dx de’. 


~ On? h x? + x’? 

[4.25] 
IH. 
The question now arises whether there is any simple criterion 
which permits a unique coordination of distribution functions to 
wave functions. The only one known to this writer is to demand 
that, in the force-free case, the equation of motion of P be the 
classical one 


TAE Se [4.26] 


This, together with the postulate of invariance [4.11], and the 
conditions (a) and (b) of the first section specify P uniquely. The 
corresponding K will turn out to be given by [4.23a] so that the 
g-p symmetry condition [4.18] is automatically satisfied. 
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One finds from 
ah e ae ey [4.27] 


and [4.12] that 


oP (q, p) = hi [[rer — X% — x’, x pues xyz A 
Ot 2m 


VRR) — Wi (x) h(x’) de de’, [4.28] 


Partial integrations with respect to x and x’ in the first and second 
terms, respectively, give 


ia i] (2x?, pol Kì) ET Up (x) (x!) dx dx, 
ot m h 
[4.29] 


where the lower indices of K? denote differentiations with respect 


to the first and second variables, respectively, 


2K0 0 
Ken EED Ke na FR 


The variables of K? in [4.29], however, remain 2q — x — x’ and 
x — x. On the other hand, 


[4.29a] 


_ É oP (q, p) P IE KURT My (x) Fy (x") dx dx’ [4.30] 
m  ôp m 


gives just the second term of [4.29]. It follows that 


Ke, = ORE n) o [4.31] 
ôč On 

or K(E,n) = f(E) + g(n). Condition [4.16a] now shows that 

g(n) must be independent of n, and it can be assumed to be zero 

since a constant can be absorbed into f. Then, [4.14a] shows that 

f(E) = 6(€)/xh so that we recovered [4.232], the old distribution 

function. | 
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Naturally, in the presence of forces, the distribution function 
does not satisfy the classical equation of motion 


7 
OP _ _ ôP avoP 


l [4.32] 
ôt mög ôq op 


V being the potential. There are many typical quantum phenomena 
which are in conflict with [4.32]. However, the departure from 
[4.32] contains at least the second power of A. The full equation of 
motion was given before.* 

In addition to the correction terms to [4.32], there is a departure 
from classical theory inasmuch as not every P (q, p) is realizable. 
Thus, one has, in addition to the normalization integral 


ren da ap = 1, [4.33] 


the added condition 


f f [P (4, b)]? dq dp = nh, [4.332] 


which is an expression for the uncertainty relation. In the case 
where P represents a mixture rather than a pure state, the left 
side of [4.33a] is actually smaller than 2rA. Equation [4.33a] is a 
special case of a more general one” which gives the transition 
probability between two states, w, and y%,, in terms of the cor- 


responding distribution functions 


fury al = an ÈP DRaa) da dp, 134 


reemphasizing the fact that the distribution functions have to be 
able to assume negative values. Equation [4.34] is a consequence 
of the unitary nature of the kernel (2rk) "!/?K given in [4.23a] for 
the old distribution function. 
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2: 

Positive (more precisely: nonnegative) distribution functions were defined, 
however, by K. Husimi, Proc. Phys. Math. Soc. Japan 22, 264 (1940); Y. Kano, 
J. Math. Phys. 6, 1913 (1965). See also a forthcoming paper by K. E. Cahill and 
R. J. Glauber in the Phys. Rev., and W. A. Smith and E. P. Wigner, Bull. Am. 
Phys. Soc. 14, 59 (1969). See also F. Bopp, Ann. Inst. Henri Poincaré 15, 81 (1956). 
3. 

Ref. 1. 

4. 

Ibid. 

5. 

This formula must have been known rather generally, but no explicit reference 
to it has so far been found. 
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We add to the postulate, that the distribution function give the proper probabilities for the position and momentum 
variables (actually only the former is needed) and that its connection with the wave function which it represents have the 
natural invariances, another one. This is that the integral of the product of two distribution functions be equal, except for 
a universal constant (which turns out to be 27h), to the transition probability between the two states they represent. We 
then show that it follows from these conditions that the distribution function is the one defined earlier by one of us (E.W.). 


Quantum-mechanical distribution functions provide a framework for an exact reformulation of non-relativistic 
quantum mechanics in terms of classical concepts [1,2]. This has led to various discussions of their properties 
(refs. [3—6], and references therein, provide a representative sample of recent papers on this subject) and to their 
application to a variety of problems in physics [7] *?. 

The distribution functions P(g, p) are functions of positional and momentum coordinates q and p, and various 
definitions for P appear in the literature. Thus the question arises as to what conditions are necessary for providing 
a unique definition for P. One of us has previously examined this question and provided one such set of conditions 
[2]. In addition to those listed below — (a) to (e) — the relevant new condition was that, for free particles, the time 
dependence of P becomes the non-relativistic classical one. It is our purpose here to replace this by a perhaps more 
general criterion. 

As in ref. [2], we list as our basic constraints on P: 

(a) that it be a hermitian, that is bilinear, form of the wave function y, and 

(b) that, if integrated over p, it give the proper probabilities for the different values of q, and similarly with 
peq. 

Next, on grounds of simplicity and naturalness, it was required 

(c) that the correspondence between P and the wave function y be Galilei invariant, i.e. invariant with respect 
to displacements and non-relativistic transitions, to moving coordinate systems; 

(d) that P be real for all Y. 

These conditions led to the result that [2, eq. (4.12)] 


P(q,p)= f f KO (2q —x — x',x — x’) iP &—* WA Yaya) dx dx, (1) 


the explicit form of KÌ to be determined by further conditions. Our considerations here will be restricted to one 
dimension but it will be clear that they can be generalized to higher dimensions. 

For our present deliberations, it turns out to be more convenient to replace K? by a more symmetric-looking 
function f, as follows: 


1 Permanent address: Department of Physics, J oseph Henry Laboratory, Princeton University , Princeton, NJ 08540, USA. 
+ See, for example, the various papers cited in ref. [1] of ref. [2] above. Some more recent applications are discussed in refs. [8—10]. 
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PQq,p)= | ffa- x, q- x’) POW yey ye’) dx dx’ . (2) 
As in ref. [2], we next introduce the requirement: 


(e) that the correspondence given by eq. (2) be invariant with respect to space and time reflections. 
This leads to the result that . 


F(a, B) = fE, a)" = fE, a) = f(—a, —B) = f(-B, —a) . (3) 


A new requirement — which will turn out to be the only other condition necessary to determine P uniquely — 
is: 

(£) that the transition probability between two states y and ¢ is given, in terms of the corresponding distribu- 
tion functions, P} and P, say, as follows: 


2 

| fuer ax =x [fP,@,mr,a@,p) da ar, (4) 
where k is an arbitrary constant, the same for all y and ¢, which is determined by the condition that the total inte- 
gral of P be unity. As we will see below, k turns out to be 27H. 

We will now show that we have enough information to determine the function f uniquely. 

Using eqs. (2) and (4) we obtain | 
Jv wow) vw) O@)* du dv=k [ap dq er@NNRfg-x,g - x') ype) 

X elP O-Y WHF — y,q —y') G(V)*O0") dx dx’ dy dy’. (5) 


Carrying out the p integration introduces a 2#hö[(x — x’) + (u — y’)] factor, which prompts us to introduce a new 
variable £ such that x’ =x — &,y’ =y + Ẹ, so that the right side of eq. (5) becomes 


2rñk f dq fa-x,a-xtE)fa 9,9797 E)vR)*ya — E)O0)*O— +E)dx dy dé . 
In this, we replace the variable x by u and & by u — v. Hence, we obtain, instead of eq. (5), 


f Y(u)* Yv) pu) dW)* du dv 


= 2ntik f dq f(a - u,q - V) f -7,4 -Y — u +v)plu)* YOH) +u - v) du du dy . (6) 


Since this is valid for any function Ņ, one can conclude rather easily that the factors of Y(u)* Y (v) in the two inte- 
grals must be equal, i.e. 


ou) o)* = 2mhk [Sea dy fq -u,q-v) fQ-y,q—y—urv)o(y)*o(—y tu—v). (7) 


To proceed further, we replace the integration variable y by y +v to obtain . 


o(u) o(v)* = 2nnk | dq dy fq-u,q-v)fq-v-y,q—u-y) gouty) duty), (8) 


from which we conclude that 


2nnk { dq fq-u,q-v)f@-u-y,q-v-y)=8(y). | 0) 
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Using eq. (3), we have interchanged the variables in the second f of eq. (8). 
We can, finally, regard f as a function of the sum and the difference of its variables, 


fp) = «G(@+ B), a —8). 

In terms of this x, if we substitute w for q — (u + v) and z for v — u, eq. (9) reads 
aahik fx (w,z)K(w —y,z)dw=6(y), 

from which it follows that x is a 6 function of its first variable. Hence 

F(a, B) = (a — B)5(a+B-—c). 

It follows from eq. (3), f(a, p) = f(—B, —a), that 

g(a -MölatB-c)=s(-Prta)öl-a-Pß-c), 


i.e. that c = 0. But the function g remains to be determined. 
From requirement (b), we have 


JP@.p) dp = wa. 


Hence, from eqs. (2), (12) and (13) we obtain g(q) = (th)! . Thus 
f(a, B) = (nh) 18a +B), 
which, when substituted in eq. (2), leads to the result 


Pha, p) = (A)! f [WEYER — x — x!) PM dx ax! 


= (ah)-1 f WE Yq - x) iP Ox- dx , 
Thus, if we set x =q + y, we finally obtain 
P(q,p)= (an)-! f ya +y)*Wq - y)e?iP? Ray. 


Using this result in eq. (4) leads to the result that k = 2nh*?. 


25 May 1981 


(10) 


(11) 


(12) 


(12a) 


(13) 


(14) 


(15) 


(16) 


In summary, we have shown that, if in addition to the postulates (a) to (e), already discussed in ref. [2], we add 
the postulate that the integral of the product of two distribution functions gives, up to a universal constant (which 
turned out to be 27), the transition probability between the corresponding two states, we are led uniquely to the 


distribution function given by eq. (16) — which is the original distribution function given in ref. [1]. 


One of us (R.F. O’C) was partially supported by the Department of Energy under Contract No. DE-AS05- 


79ER10459. 


+2 Hence eq. (4) is equivalent to eq. (4.34) of ref. [2], except that the latter equation — and also eq. (4.33) of ref. [2] — has 


the 2rA factor in the wrong side of the equation, as is evident from dimensionality considerations. 
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We consider the distribution function obtained by smoothing the original distribution function, defined in an earlier 
publication, with a ground-state harmonic oscillator wave function. We derive its time dependence and show that, in par- 
` ticular, the field-free result does not correspond to the classical result. We point out that the non-negative property of the 
smoothed function follows immediately from the fact that the integral of the product of two of the original distribution 
functions is equal, except for a factor 2rÄi, to the transition probability between the two states they represent. 


1. Definition of a non-negative (“smoothed”) quantum mechanical distribution function. In a recent publica- 
tion [1], we examined the conditions which are necessary for providing a unique definition for a quantum-mech- 
anical distribution function P(q, p), where q and p are positional and momentum coordinates. The conditions we, 
listed lead uniquely to the result that, for every normalized state vector W, 


P(q,p)= (ah)! f vq ty)" - yeh dy , (1) 


which is the original distribution function given in ref. [2]. Some of the properties of P are [1—3]: 

(a) It is a hermitian, that is bilinear, form of the wave-function y. Hence it is real for all q and p. The hermitian 
operator is, of course a function of q and p. 

(b) If integrated over p, it gives the proper probabilities for the different values of q, and similarly with p + q. 

(c) The transition probability between two states y and ¢ is given, in terms of the corresponding distribution 
functions, Py and P, say, as follows: 


| fue) ax] = 2mh ff P,(q.p)Ps(a.P) dq dp. (2) 


As pointed out in ref. [2] and proven in ref. [3], conditions (a) and (b) are incompatible with the requirement 
that the distribution function be everywhere nonnegative. In fact it is clear from condition (c) that the distribution 
function given by eq. (1) has to be able to assume negative values. 

Starting with Husimi [4], many authors obtained non-negative distribution functions by dropping condition (a). 
In most cases this is essentially achieved, for all points (q, p), by smoothing P(q’, p') with a density function D(q', 
p') and integrating over all p' and q’. 

A natural and popular choice of the density function D is a gaussian distribution [4—8], recently considered 
anew by Cartwright [9]. 

Our considerations here will be restricted to one dimension but it will be clear that they can be generalized to 
higher dimensions. Consider a linear harmonic oscillator, centered at the point q and moving with an average mo- 
mentum p. Then, as is well known, the ground-state wave-function, at the position q and average momentum p 


1 Permanent address: Department of Physics, Joseph Henry Laboratory, Princeton University, Princeton, NJ 08540, USA. 
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becomes, as function of q’, 
Vg, pla’, a) = (ra) -4 ea a /Daeipa'h , (3) 
where (Aq')? = a/2. Using this expression for Y p in eq. (1), it may easily be verified that the corresponding dis- 
tribution function, Py „(q’, p’, a) say, is 

’ ' 2 tp}? 
Py p(q’, p’, a) = (ahi)! e-@-Dlae-ap-pVM?. (4) 
This function has the property that 
(Ag)? =a/2 and (4Ag')(Ap')=ñ/2. (4a) 


This is the density function which, following Husimi [4], we will use to smooth P(q’, p'). Thus, the resultant smooth- 
ed distribution function, P (4, p, œ) say, is simply 


Pxa,p,a)= |f Pa, p')P pap a) dp’ da’. (5) 


Carrying out an explicit calculation, Cartwright [9] showed that P,(q, p, a) is everywhere non-negative. However, it 
is clear that since the rhs of eq. (5) is a particular case of the rhs of eq. (2), and since the Ihs of eq. (2) is non-nega- 
tive it follows immediately that 


Pq, p,a) 20, (6) 


for all q and p. 
Before turning to a consideration of other properties of P,, not heretofore considered in the literature, it is con- 
venient to use eqs. (4) and (5) to write explicitly 


P (a, p,a) = (ah)! ff P(g’, pje- Da e-atp PM? aq! ap’ , (7) 


where the P without the index is the old distribution function (1). It may be of some interest to remark here al- 
ready that P, is an “entire function” of q and p, i.e. that the range of these variables may be extended to the whole 
complex plane without encountering any irregularities. First of all, by simple differentiation, it is easy to verify 
that 


aaP, /ða = (a/4)32P, /dq2 — (A? /4a)d2P, jap? . (8) 


2. Time dependence of the smoothed distribution function. We next consider the equation of motion of P,. 
The time dependence of P, may be decomposed into two parts, 


OP, /Ot = 0,P,/dt + 0,,P,/dt , (9) 
the first part resulting from the (i/2m)d2/dq2, the second from the potential energy V/ih part of the expression 
for dy/ot. 


It has already been shown [2] that the time dependence of P(q', p') corresponds to the classical result in the 
field-free case, i.e. 


8, P(q', P’)/at = —(p'/m) aP(q', p')/2q" , (10) 
and in the presence of a potential we have the extra contribution 

a,P(q', P Yt = faj Pap +I), (11) 
where 
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J(q’, i) = G/A?) f [Va +y) Va - y)] e- Wil ay (12) 


is the probability of a jump of the momentum by an amount j if the positional coordinate is q’. 
Applying the results to eq. (7), we have, first of all, that 
I P.(q, P, &) 


= p' dP(q', p’) la’? en! fi 2 fn? ' ' 
= —(mh) EARE (q'—q) la e-a(p'-p) } dq’ dp 
or . m aq 


(13) 
CO a P pg',p Ve z {670-0 e-atp- py? da! dp‘, 
the right side having been rom by partial integration. But since the 0/dq', as applied to the expression in the 


curly bracket can be replaced by —d/0q, we also have 

OxPs(Q,P,&) _ 
ot 

The classical expression for 0, P,(q, p, «)/dt would be 


ðP (q, p, &) 
-f l ðq = (mh)! : a SS mPa’ p'){e-(1' -0 la e—a(p’- p? h?) dg’ dp’ . (14a) 


—(nñ)-! — 2 È Sfera, p'){e~ @'- 9)" la e-atp'-p)?/n?} dq' dp’ . (14) 


The difference between the two expressions (14) and (14a) is 


(nh)-! — g wer p P(q',p', a) {e-@- -4% a e—a(p’ — Pp)? M?) dg! dp' 


(15) 
2 
= (mh) — -@- 9)? la e-atp'-p)? m? h o? 
an- A SSPE ate a } dq’ dp'= 5 g ap PEPA 
Hence we obtain 
O,P.(q,P,0) 4 fi2 2) 2 
or --2( er op} aq s(4; P, a) . (16) 


It is thus clear that, in the field-free case, the time dependence of P,(q, p, a), in contrast to that of P(q, p), is not 
given by the classical expression but contains a correction term of order #2. But this is not a quantum effect: the 
same expression would appear in the time derivative of the classical distribution function if this were “smoothed” 
as in (7). 

We turn now to a consideration of 


ð P.(q, P, & 0,P(q', p 
== 37 > ah)! SEE a ~4)" Ja e-alp'-P)’M?} da! dp' . (17) 


Thus, utilizing eqs. (11) and (12) in eq. (17), we obtain 


0,Ps(a,P, &) i 
Fe = Vera dp’ dq’ dj dy P(q’,p' +j)[Y(@’ +y) - Va’ - y) 
ot (th) (nh?) SSff (18) 
x {e-@'-4)?/a e-atp'- PM? @—2ijyh} =_ 2 Im7 
n2h3 ’ 


where 
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I= SITT dp' dq' dj dy P(q';p' +7) V@' +y) {e70 -0a e-a(p'-p? A? e-2UyM] | (19) 
Replacing y by a new variable z = y + q' and p' by p” =p’ + j this becomes 
1= [JSS Pa',p")e-@rieih- Pl e-atpti- PM? Via) e-ai?m?+2ija-2M ap" aq' dj dz (20) 


Because of eq. (7) and the possibility of extending P, also to complex values of q and p, this can be written 
also as 


l=nh Se dz P.(q t+ iaj/h ,p +j, a) V(z) e-aj A? +2ij(g-2)M | (21) 


As was observed already before, P, is an entire function so that it remains uniquely defined in spite of the complex 
nature of one of its arguments. It now follows from eqs. (9), (16), (18) and (21) that 


OP (4 P, &) IKP (4P, 0) app) _ 1( E2) dP (q, P, &) 


or ðt ðt m 2a dp ðq (22) 


-iG ff (Pq - ij, p + hj, aje- 0-2 — Pq + ioj, p +hj, aje? 0-2] Yz)e=w” dj dz , 


the integration variable j of (21) having been replaced by hj. 

Eq. (22) is, probably, the shortest expression for the time derivative of P,. It may be observed that the time 
derivative of the smoothed distribution is not very simple even in classical theory — not even if we restrict ourselves, 
as was done in all the preceding discussion, to the non-relativistic limit. 

The extension of P, into the complex plane in (22) can be made unnecessary by expanding the P, under the 
integral sign into a power series of iaj. The exponentials of ij (q — z) are then replaced, for the successive q deriva- 
tives of P,, by sin 2j (q — z), cos 2j (q - z), —sin 27 (q — z), —cos 2j (q — z), again sin 2j(q — z), and so on. For the 
factor accompanying the nth derivative of P, this can be written as sin[2j(q — z) + + nn]. Hence one obtains 


LACH ij n2 3.) an. 


ar “mAP S20 ap) ðq 


22a) 
1s 1 Pa pta) FAR ( 
a aldi - + hmm] V(z)e-e/ dj dz. 


There are two, apparently different but mathematically identical, expressions for the time derivative of the old 
distribution function P of (1): the one given by eq. (11) of ref. [2] (eqs. (10) and (11) of the present article), the 
other by eq. (8) of the same article. The latter expression is a power series in #2 and shows the analogy to the 
classical expression for dP/dt more clearly. The next section will derive the analogue of this for 97, /ör, using the 
old expression (8) of ref. [2] for dP/dt. 


3. Other expression for the time derivative of P,. The alternate expression for dP, /dt is also a sum of the two 
parts, as given by (9). Since the first part is again given by (16), only the second part, i.e. 0,,P,/0z will be recalculat- 
ed. According to eq. (8) of ref. [2] 


BPP) =), +(e) ^V (q) OXP(q, p) 


2i 
The summation over X is to be extended over all odd positive integers. We have, therefore, 


(23) 
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vPs(9,P,%) 4 In +E A-1 
ðt mh NNi 
(24) 


d\V(q') P,P) (q'—q)*/a e—a( 2 /h? 
oa ies -a(p'- p) /h 
X SJ 592 ap {e } dq‘ dp’. 


By partial integrations with respect to p’ and replacement of the p’ derivatives of the last factor by its negative p 
derivatives one obtains 


-l 
Be 42 y = sf DVA) pi‘, p'){e-@'— Wl eat’ PM?) aq' dp (25) 
ðt mh a A! \2i Ip‘ ðq `^ 
It is natural now to expand the derivatives of V into a power series of q' — q, 
AV), 1 RVG) 
ðq'^ u u! qtu 


@-q#. = 
We then use the identity +! 


2 Wu! &u—K du-2xe-x?/a 
ke-Xx”/a at } u! > er 2 Ae RANG 
we 2H KC — 2K)! gyre” (27) 


in which the summation over x is to be extended from 0 until u — 2k becomes negative. It may be observed that if 
u is even, the last term of (27) contains the function ie Ja itself, if uis odd the last term contains the first deriv- 
ative of this. Introducing now (27) into (25) with x =q’ — q and replacing the q’ derivative of the last factor by 
the negative q derivative [which introduces a factor (—)#], one obtains finally 


ð P (q, P, a (h/2iy—lae—* — adt#V(q) a` AA SS Pq’ p'ye-@'-9)" la e—a(p'—p)*/n? dp’ dq’ " 
D mhp A2eKI(u— 2K)! aqgatH ap aqu-2e TI T? | 

Hence 

OP (q,p, pr na ee USE SEN u ER ei en (29) 
or wer OP} ðq ° aur 2AtH-l IK NC — 2K)! BQATH apă ägu-2« ` ERR 


It may be good to recall that A assumes all odd — but only odd — values from 1 to infinity, x all integer values from 
O as long as u — 2x remains non-negative, u all integer values from O to infinity. 

The second part of (29) is surely not simple, neither is the corresponding expression in the classical limit (A = 0). 
Eq. (29) was derived because it is easier to derive from the approximate expressions for OP, /ðt than it is from (22). 
If the “smoothing” over the coordinate is very narrow, i.e. if œ is assumed to be very small, the second part of OP, / 
ðt naturally goes over into the expression for dP/dt as given by eq. (23). If A is also assumed to be small, it goes 
over into the classical expression. But many other approximate expressions for ðP, /dt can be derived from (29), 
as were also from (23). 

The preceding calculations and the resulting equations are apparently restricted to the case of a single dimension. 
However, it is not difficult to generalise them for the general case of several dimensions. As to the more simple 
expression (22) for dP, /dt, the variables q, p, j and z have to be treated as vectors of as many dimensions as there 
are dimensions in coordinate (or momentum) space. The exponents j (q — Z) are to be replaced by the scalar prod- 
ucts of the vectors j and q — z, the j? byj*j. In the case of (29) the situation is a bit more complex: the indices 
K, A, u must be replaced by as many sets of indices K,,, A,,, Hp as there are space (or momentum) dimensions and 
the summations extended over all these indices. The restrictions are then that the sum of all the X must be odd and 
that all the u, — 2x, must be non-negative, as are also all K}, Un, An- 


+1 We could not find this equation in the literature, even though it must have been known. It is not difficult to verify it — for 
a = 1 the factors of exp(—x?) on the right side are Hermite polynomials. 
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Abstract: 

This is the first part of what will be a two-part review of distribution functions in physics. Here we deal with fundamentals and the second part 
will deal with applications. We discuss in detail the properties of the distribution function defined earlier by one of us (EPW) and we derive some 
new results. Next, we treat various other distribution functions. Among the latter we emphasize the so-called P distribution, as well as the 
generalized P distribution, because of their importance in quantum optics. 


1. Introduction 


It is well known that the uncertainty principle makes the concept of phase space in quantum 
mechanics problematic. Because a particle cannot simultaneously have a well defined position and 
momentum, one cannot define a probability that a particle has a position g and a momentum p, i.e. one 
cannot define a true phase space probability distribution for a quantum mechanical particle. Nonethe- 
less, functions which bear some resemblance to phase space distribution functions, “quasiprobability 
distribution functions’, have proven to be of great use in the study of quantum mechanical systems. 
They are useful not only as calculational tools but can also provide insights into the connections 
between classical and quantum mechanics. 

The reason for this latter point is that quasiprobability distributions allow one to express quantum 
mechanical averages in a form which is very similar to that for classical averages. As a specific example 
let us consider a particle in one dimension with its position denoted by q and its momentum by p. 
Classically, the particle is described by a phase space distribution Pcı(g, p). The average of a function of 
the position and momentum A(g, p) can then be expressed as 


(A)a= | dq | ap Ala, p) Paq, p). (1.1) 


The integrations in this equation are from —% to +. This will be the case with all integrations in this 
paper unless otherwise indicated. A quantum mechanical particle is described by a density matrix 6 (we 
will designate all operators by a °) and the average of a function of the position and momentum 
operators, Ä(4, p) is 


(A) quant = Tr(A@) (1.2) 


(Tr O means the trace of the operator Q). It must be admitted that, given a classical expression A(q, p), 
the corresponding self-adjoint operator A is not uniquely defined -and it is not quite clear what the 
purpose of such a definition is. The use of a quasiprobability distribution, Po(g, p), however, does give 
such a definition by expressing the quantum mechanical average as 


(A) quant = f dq | dp A(q, p) Pa(q, p) (1.3) 


where the function A(q, p) can be derived from the operator A(d, f) by a well defined correspondence 
rule. This allows one to cast quantum mechanical results into a form in which they resemble classical 
ones. 

The first of these quasiprobability distributions was introduced by Wigner [1932a] to study quantum 
corrections to classical statistical mechanics. This particular distribution has come to be known as the 
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Wigner distribution, and we will designate it as Pw. This is, and was meant to be, a reformulation of 
Schrödinger’s quantum mechanics which describes states by functions in configuration space. It is 
non-relativistic in nature because it is not invariant under the Lorentz group; also, configuration space 
quantum mechanics for more than one particle would be difficult to formulate relativistically. However, 
it has found many applications primarily in statistical mechanics but also in areas such as quantum 
chemistry and quantum optics. In the case where Pg in eq. (1.3) is chosen to be Pw, then the 
correspondence between A(q, p) and A is that proposed by Weyl [1927], as was first demonstrated by 
Moyal [1949]. Quantum optics has given rise to a number of quasiprobability distributions, the most 
well-known being the P representation of Glauber [1963a] and Sudarshan [1963], which have also found 
extensive use. As far as the description of the electromagnetic field is concerned, these do exhibit 
(special) relativistic invariance. Other distribution functions have also been proposed (Husimi [1940]; 
Margenau and Hill [1961]; Cohen [1966]) but have found more limited use, although, more recently, 
extensive use has been made of the-generalized P representations by Drummond, Gardiner and Walls 
[1980, 1981]. In this paper we will discuss the basic formalism of these quasiprobability distributions and 
illustrate them with a few simple examples. We will defer any detailed consideration of applications to a 
later paper. 

We now proceed to the basic problem: how do we go about constructing a quantum mechanical 
analogue of a phase space density? Let us again consider, for simplicity, a one particle system in one 
dimension which is described by a density matrix p. In this paper we will work, for simplicity, in one 
dimension; the generalization to higher dimensions will be given in a few cases but is in most 
circumstances obvious. It is possible to express the position and momentum distributions of the particle 
as 


Pros(q) = Tr(65(q ~ 4)) (1.4a) 


Pmom(P) = Tr(6 6(p - P)) C2) 


where ô(q - 9) is the operator which transforms |q’) as follows: 


5(q- â) lq) = |q) (alq’) = ölq - 9’) lq’) (1.5) 


and similarly for 6(p - p). We introduce the function p(q', q”) defined by 
p(q', 4") = (q'l6la") = > wa yala’) aa’) (1.6) 
A 
where w, is the probability of the system being in the state y, and the {y,} form a complete set. Then 


P2o0s(9) = p(q, q) (1.7a) 
and 


Pmom(p) = (2ah)* | dx | dx’ p(x, x’) expfip(x'- x)/h}. (1.7b) 


t We use this designation here and throughout the paper despite the strenuous objections of one of us since the majority of us feel we should 
adhere to what is now common nomenclature. 
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To show that this corresponds to the usual definition we will examine P,..(q). We have that, in the 
Dirac bracket notation, 


Prola) = THB 5(9- 4) = | dq’ (a1äötg- Gla’) 


= | dq’ 8¢q- 0) q'éla’) = (aléla) (18) 


which is a more conventional expression for the position density. A first guess for some kind of a phase 
space density might then be 


P,(q, p) = Tr 6(q - â) 8(p — p)). (1.9) 


On the other hand, we might choose instead 


P2(q, p) = Tr(6 6(p - p) (4 - â)) . (1.10) 


But these expressions are not equal and although either of them, or a combination of both, could be 
used to evaluate expectation values of functions of G and f (provided the operators are ordered 
properly, the ordering for P, being different than that for P2), they do not possess what we regard as 
desirable properties (see section 2). In fact, they are, in general, not real. 

The association of distribution functions with operator ordering rules (or, equivalently, the asso- 
ciation of operators with classical expressions) is one which will recur throughout this paper. Each of the 
distribution functions which we will discuss can be used to evaluate expectation values of products of 
operators ordered according to a certain rule. We will consider distribution functions which can be used 
to compute expectation values of products of the position and momentum operators g and p, and also 
distribution functions which can be used to compute expectation values of products of the creation and 
annihilation operators, ä* and â. The latter are useful in problems involving electromagnetic fields. 
Because the creation and annihilation operators are simply related to g and # there is a relation between 
these two types of distribution functions. The Wigner distribution, for example, has proved useful in 
both the â, a” and f, Â contexts. The basic criterion for the choice of a distribution function for a 
particular problem is convenience. 

In the next two sections we will continue to examine distribution functions expressed in terms of both 
the position and momentum variables. The Wigner function, Pw, will be discussed first in section 2 for 
not only was it the first quantum mechanical phase space distribution to be considered, but also it 
satisfies a number of properties which make it quite useful in applications. First of all, we will discuss its 
properties and then show that Wigner’s distribution function gives the same expectation value for every 
function of p and q as does the corresponding operator, as proposed by Weyl [1927], for the density 
matrix which describes the same state to which the distribution function corresponds. As was mentioned 
before, this was first observed by Moyal [1949]. Next we derive an equation, in many different forms, 
for the time dependence of P,. Finally, we apply the formalism we have developed to the calculation of 
P,, for the eigenstates of the harmonic oscillator and also for the case of a canonical ensemble of 
harmonic oscillators at temperature T. 

In section 3 we discuss distribution functions other than P, which correspond to operator ordering 
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schemes different from that of Weyl-Wigner. Then in section 4 we treat distribution functions in terms 
of creation and annihilation operators, with emphasis on normal, symmetric and anti-normal ordering. 
In particular, we emphasize the normal ordering from which arises the well-known P distribution of 
quantum optics. We also discuss the generalized P representations. Finally, in section 5 we present our 
conclusions. 

Applications will be treated in a future paper but we would be remiss not to mention the recent 
extensive review of quantum collision theory using phase space distributions (Carruthers and Zachari- 
asan [1983]) and the work on relativistic kinetic theory-in addition to extensive discussions on the 
Wigner-Weyl correspondence - by the Amsterdam group (Suttorp and de Groot [1970]; Suttorp [1972]; 
de Groot [1974]; de Groot, van Leeuwen and van Weert [1980]). Also, a brief overview of some 
applications is presented in O’Connell [1983a,b]. 


2. Wigner distribution 
2.1. Properties 


In a 1932 paper (Wigner [1932a]) the distribution 


8 


1 ’ i 
Pag P)= =z | dy 4- ylöla+ y) e” (2.1) 


8 


was proposed to represent a system in a mixed state represented by a density matrix ĵ. In the case of a 
pure state, y, it follows from eq. (1.6) that p(g’, g”) = W(q’) w*(q") and hence 


P.(q.p)=— | dyw"(q+y)u(q~ yer”. (2.22) 


8 


The latter result refers to one dimension. In the case of more than one dimension, the A must be 
replaced by (7h), where n is the number of the variables of y (or the number of variables of the rows 
or columns of 5) and q, y and p are n-dimensional vectors, with py the scalar product of the two. The 
integration is then over all components of y. Explicitly, eq. (2.2a) generalizes to 


eo 


P«(q1 - - - Qn Pis - - - Pn) = (ahy” | BR | dyi- -dyn W*(qit Yis... Gn + Yn) 


00 


X P(g- Yis - - -Gn — Yn) eXP[2i(pı yi +*+ Payn)/A]. (2.2b) 


It was mentioned that this choice for a distribution function was by no means unique and that this 
particular choice was made because it seemed to be the simplest of those for which each Galilei 
transformation corresponds to the same Galilei transformation of the quantum mechanical wave 
functions. In later work Wigner [1979] returned to this issue by considering properties which one would 
want such a distribution to satisfy. He then showed that the distribution given by eq. (2.1) was the only 
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one which satisfied these properties. A subsequent paper by O’Connell and Wigner [1981a] considered 
a somewhat different list of properties and showed that these, too, led to the expression in eq. (2.1). 
The properties for a distribution function, P(q, p), which were considered of special interest, for the 
case of a pure state (generalization to the case of a mixed state is straightforward), are as follows 
(O’Connell and Wigner [1981a}): 
(i) P(q, p) should be a Hermitean form of the state vector #/(q), i.e. P is given by 


P(q, p)= (v|M(q, pW) (2.3) 


where M(q, p) is a self-adjoint operator depending on p and q. Therefore, P(g, p) is real. 
(ii) 


f ap Pa, p)= a? = (alle) 0.4) 
f da Pia p) = (plölp) 2.5) 
| dq | dp P(g, p)=Tr(é)=1. (2.6) 


(iii) P(g, p) should be Galilei invariant, i.e. if w(q)> ¥(q+ a) then P(q, P)> P(q+a,p) and if 


W(q)— explip’g/h) ¥(q) then P(q, p)> P(q, p- p’). 
(iv) P(q, p) should be invariant with respect to space and time reflections, i.e. if Y(q)> #(—q) then 


P(q, p)> P(-q,-p) and if Y(q)> Y*(q) then P(q, p)> P(q, -p). 
It should be admitted, however, that neither of these transformations is relativistic and also that they 


do not yet involve the spin variable. 
(v) In the force-free case the equation of motion is the classical one 


OF cee (2.7) 


(vi) If P,(q,p) and P,(q,p) are the distributions corresponding to the states y(q) and ¢(q) 
respectively then 


|| agy”) gta) = enn) | da | dp Pola, p) Pola). 2.8) 
Property (vi) has two interesting consequences. If we set d(q) = y(q) we get 
pand 
f ag | ap Paa = 3% 2.9) 


and, in the case of a mixed state, the right-hand side of eq. (2.9) is multiplied by 2, wi where the wx are 
the probabilities for the different states (the characteristic values of 6). This implies that P,(q, p) is not 
too highly peaked and rules out such distributions as P,(q, p)= 6(q— 4) 6(p— p) which would be 
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possible classically. We can also choose @ and yw so that they are orthogonal. We then have that 


| dq | dp Ps (q, p) Ps(q,p) = 9 (2.10) 


which implies that P(g, p) cannot be everywhere positive. This conclusion is actually rather general. 
Wigner [1979] has shown that any distribution function as long as it satisfies properties (i) and (ii) 
assumes also negative values for some p and q. 


(vii) 
f aq | dp Ala. P) Ba, p) = Ort) TAB) 11) 


where A(q, p) is the classical function corresponding to the quantum operator Á, and is given, according 
to Wigner’s prescription, by 


Alg, p)= | dz e”"* (q-z]Alg+ 32) (2.12) 


so that f f dg dp A(q, p) = 27h Tr(A). A similar relation exists between B(g, p) and B. 

The proof of eq. (2.11) will be shown below to follow as a particular case of a more general relation 
(eq. (2.23)) for F(q, p), in terms of A(q, p) and B(q, p), where F = AB. From eq. (2.12), it is at once 
evident that the phase space description A(q,p) of the operator A is real if A is self-adjoint 
(Hermitean) and is imaginary if A is skew Hermitean. Since in neither case does A(q, p) vanish, it is 
evident that if it is real, its operator A is self-adjoint, if it is imaginary A is skew symmetric. It is also 
evident that the phase space description of the Hermitean adjoint Ä* of A is the complex conjugate of 
the similar description of A. Similarly, if the phase space descriptions of two operators are complex 
conjugates of each other, then the operators are Hermitean adjoints of each other. 

By comparison of eqs. (2.1) and (2.12), it is clear that P(q, p), derived from the density matrix, is 
(2rh)"' times the phase space operator which corresponds to the same matrix. Also, for A = f and B 
equal to the unit matrix, eq. (2.6) immediately follows from eq. (2.11). Furthermore, for Ê = 6, eq. 
(2.11) reduces to 


f aa | dp Ata, p) Pola p) = THE AG, P), 213) 


which is equivalent to eqs. (1.2) and (1.3). This result was originally obtained (Wigner [1932]) for the 
special case of A being the sum of a function of f only and a function of q only but Moyal [1949] 
showed it was actually true in the case where A is any function of ĝ and ), if AG, p) is the Weyl 
operator (discussed below in section 2.2) for A(q, p). In addition, if we take A = B = ĝ in eq. (2.11) and 
use the fact that, if 6 represents a pure state, Tr(6) = Tr 6 = 1, we obtain eq. (2.9) again. 

(viii) If we define the Fourier transform of the wave function 


$CP) = nny | agyi, (2.14) 
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then eq. (2.2a) can be re-written in the form 


P(g, p)= (mh) | dp’ 6*(p+ P) d(p- perme, (2.15) 


exhibiting the basic symmetry under the interchange q © p. 
It may be worth observing also that the contraction of the distribution function from n to n-1 
variables 


f | Pian- anon qn; Pis +++ Pn-1, Pn) dgn dpn 


= (rh [++ f pq yi- -gn-1— Yazi Ga = Yni Gat Yas + +- Ani + Yazi» Ga + Ya) 
x expl2i(pı yi +'** + Pa-1Yn-1 + Pn Yn hy dy1* ` * dYn-1 dyn dgn dpn 
= (m |e f olaa- Yi- dant Yaris Gn = Yni DE Yis- Geet + Ynis Ga + Ya) 
x exp[2i(piyit "+ Pa-1Yn-1)/A] (Yn) dyi* ++ dYn-1 dyn dgn 
= (my [+++ LPi- Yi- -+ dnt Yan ts Gn Qa + Vise Gna + Yazar Go) A] 
x exp[2i(pı yıt + Pnr-1Yn-1)/A)] dy1 + dYn-1 (2.16) 


gives the distribution function which corresponds to the properly contracted p (in square brackets). 
Actually, this is true also for the other distribution functions which will be considered in section 3. 

Wigner in his 1971 paper also showed that properties (i)-(v) determined the distribution function 
uniquely. O’Connell and Wigner [1981a] showed that properties (i}-(iv) and (vi) also accomplish this. In 
both cases the distribution function was that given by eq. (2.1). 

Finally, we draw attention to two restrictions on the distribution function discussed above. First of 
all, as already mentioned, it is non-relativistic. Secondly, not all functions P(q, p) are allowed, as we will 
now demonstrate by turning to the question of the admissability of P and asking what condition is 
necessary so that P implies the existence of the density function p, the expectation values of which are, 
naturally, positive or zero. Our starting-point is eq. (2.2a) from which it follows that 


f ap e= Pig, p)= plg- y,q+ y). 17) 
Hence, changing variables to u = q + y and v = q — y, we obtain 

p(v, u) = | dp e "WW PÈ(u + v), p). (2.18) 
We remark that since p on the right-side of eq. (2.18) is a dummy variable it is clear that it could be 


replaced by q. 
Now the condition for P(q, p) to be a permissible distribution function is that the corresponding 
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density matrix be positive definite, i.e. 


| dx | dx’ y*(x) p(x, x) ya’) > 0 (2.19) 
for all y. Using eq. (2.18) and eq. (2.19a), it follows that the condition that P(q, p) be permissible is that 
[da [PP P'a p= (2.19b) 


for any P'(q, p) which corresponds to a pure state. This is evident already from eq. (2.8). It also follows 
from eq. (2.11) and the fact that Tr(66’)=0. Eq. (2.19b) holds, of course, for any P’ which is itself 
permissible but the permissibility of P follows already if it is valid for all P’ which correspond to a pure 
state. 

Eight properties of the distribution function were discussed above, eqs. (2.3) to (2.16), with the 
emphasis on the use of this function to form another description of a quantum mechanical state, i.e. be 
a substitute for the density matrix. Just as eq. (2.1) permits one to give a phase space formulation to the 
density matrix f, we emphasize that eq. (2.12) permits one also to give a phase space formulation to any 
matrix — Or operator — and it may be useful to consider the properties of eq. (2.12). 

In particular, we wish to derive an expression for the function F(q, p) which corresponds to the 
product F = AB of two operators A and B to which the q, p functions A(q, p) and B(q, p) correspond. 
We assume that the operators A and B are matrices, the rows and columns of which can be 
characterized by a single variable, but the generalization to a many-dimensional configuration space is 
obvious. We can write, therefore 


F(x, x")= | A(x, x’) B(x’, x") dx’. (2.20) 
Analogous to eqs. (2.17) and (2.18), eq. (2.20) can be written as (taking A = 1 for this derivation) 
| dp, Fae +x, pee any | | | dx’ dp’ dp" Ada + x’), pero 
x BO(x' + x"), perro), (2.21) 
Substituting x = q+ q’, x" = q — q', multiplying with e~"7” and integrating over q’ one obtains 
F(q, p)=2(27)* | | | | dq’ dx’ dp' dp" A(a(q + q' + x’), p') BG(q - q' + x’), p”) 
x expi-ig'(2p = p'- p")~ i(p"— p') (q- x}. (2.22) 
Introducing finally new variables y = 3(q + x’), y'=3q', p' = p —p', p"=p + p', one obtains 
F(qp)= 160m)? | | | dy dy’ dp dp’ A(y + y',p- p) By yp +p) 
x exp{-4i y'(p — p)— 4ip'(q— y)} 


= sem)? || fay dy’ dp do’ Alg+y+y’,p+p-p')B(q+y-y,p+p+p')e'®"”. 
(2.23) 
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This expression for F(g, p), which is a new result, also shows the similarity of the roles of p and q in 
Hamiltonian mechanics. In the next subsection, another expression (eq. (2.59)) for F(q, p) will be 
presented. 

If we integrate F(q, p) in eq. (2.23) over q and p, we obtain 


| | Flarydqap=16eny? | | | | dy ay dp do’ Aly +y p-p) BO -y'o +) 
x exp{4iy’p+ 4ip’y} (427/16) 5(y’) ö(p'). (2.24) 


Hence 


f | Fa.p)da ap = | | AG. p) Ba. p) da ap. 2.25) 


Since the left-hand side of this equation is the same as (27h) Tr(F), it is clear that eq. (2.25) is the same 
as eq. (2.11). In the case of n dimensions, the nth power of (4/77) appears in the expression 
corresponding to eq. (2.23). f j 

Eq. (2.23) provides also a means to ascertain, in terms of the phase space descriptions of A and B, 
whether these two operators commute. Naturally, the condition for the commutative nature is 


[fle dy’ dp dp' A(y + y',p—p') Bly — y',p + p') exp{-4i y'(p - p)- 4i p'(q - y} 


u J J l | dy dy' dp dp’ B(y + y',p - p') Aly- y’,p + p') expt-4i y'(p— p)- 4ip'(q- y}. 
(2.26) 


Since this is valid for all p and q, the integration over the variables which are their factors in the 
exponent (i.e. y’ and p’) can be omitted. This gives as condition for the commutability of A and B (we 


replace y, y’ by q, q’ and p, p' by p, p’): 
| Jo dqa [Alg+q',p- p')B(q-q’,p+p')- Alg-g',p+p')B(a+gq',p-p') 


xexp{4i(q'p + p'q}=0, (2.27) 


a somewhat unexpected expression. 

The last quantum mechanical relation that will be translated into phase space language is the 
equation Ag = Ap specifying that the wave functions of which ô consists are characteristic functions 
(eigenfunctions) of A with the characteristic value A. Whether p contains only one or more such 
characteristic functions depends whether or not its phase space representation, P,,, satisfies eq. (2.9), i.e. 
whether its square integral is equal to or smaller than (27h) ”’. 

The Ap = Af relation, with p represented by Pw, reads, according to eq. (2.23), in phase space 
language: 
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(ain) | | | | ay ay’ dp dp’ A(y +y’ p-p) Puly- y’, p + p')explái y'(p- p)+ 4ip'(y-4)} 
= à P(g, p). (2.28) 


In order to simplify this, one can multiply with exp{4i(q'p + p'q)} and integrate over p and q to obtain, 
substituting also q and p for the integration variables y and p, 


| | dq dp A(q + q', p- p') Pola- q’,p + p’) expiäi(g’p + p'a} 
=à | dq dp Pulg, p)expldi(a’p + Pa). (2.28) 


Both eqs. (2.27) and (2.29) are a good deal more complicated than the quantum mechanical equations 
for which they substitute. It is questionable whether they are really useful. We thought that they should 
be derived in spite of this because the final form is considerably simpler than the original one and 
because they clearly demonstrate the essential phase space equivalence of q and p. It may be worth 
remarking finally that in the case of several dimensions all variables should be considered as vectors, and 
products like g’p or p’q should be replaced by scalar products of these vectors. 


2.2. Associated operator ordering 


We will now discuss the connection between a classical function of q and p and a quantum 
mechanical operator which is supposed to correspond to it. The result of the measurement of a quantum 
mechanical operator is well defined: it is supposed to transfer the state of the system on which the 
measurement is carried out into one of the characteristic vectors of the operator in question, and the 
probabilities with which the different characteristic vectors would result from the measurement are also 
well defined. They are the squares of the scalar products of the normalized initial state of the system on 
which the measurement is carried out and of the operator’s normalized characteristic vector into which 
the state of the system is transformed. It must be admitted, even in this case, that, given an arbitrary 
operator, it is in many cases difficult, in others impossible, to construct an apparatus which can carry out 
the measurement, i.e. the desired change of the state of the system on which the measurement is to be 
carried out. 

But as far as the measurement of a classical function of p and g is concerned, no similar postulate 
exists which can be formulated in classical terms. But Weyl did propose the association of a quantum 
mechanical operator to every function of q and p and defined the measurement of the classical quantity 
as being identical with the above described quantum mechanical measurement of the operator which he 
associated to the classical function of g and p. This association will be described below. What is 
remarkable, however, and what has been first pointed out by Moyal [1949], is the close connection 
between Weyl’s proposal and the distribution function as defined above. In particular, the expectation 
value of the result of the measurement of the operator A, which Weyl associates with the classical 
function A(g, p) if carried out on a system in the state y, 


WlAlwy= | da | dp Puta, P) AGP) (2.30) 
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is equal to the expectation value of the classical function A(g, p) to which A corresponds assuming that 
the system is described by the distribution function P,(g, p) which corresponds to f. This is the content 


of eq. (2.30) and it is valid, as will be demonstrated below, for every state vector % and also for any 
density matrix p 


Tr(6A) = Í dq | dp P.(q, p) A, p)- (2.31) 


Actually eq. (2.31) is an easy consequence of eq. (2.30) and only the latter will be proved below. 
In order to prove eq. (2.30), we start with Weyl’s expansion of A(g, p) into a Fourier integral (taking 
h = 1 for the purpose of this proof): 


Agave | de | dr a(o, r) e m, (2.32) 


Weyl then defines the operator which corresponds to the exponential in the integrand on the right-hand 
side of eq. (2.32) as exp{i(ag + 7p)}. The operator which corresponds to A(g, p) is then given by 


Ät@,B)= | de | Aral) orte): (2.33) 


If we substitute this result for A into the left-hand side of eq. (2.30) and replace A(q, p) on the 
right-hand side by the right-hand side of eq. (2.32), it becomes evident that all we have to prove is that 


(wlexpti(ag + rely) = | dq | dp P,(q, p) expli(og + rp)} 


= 7- | dy | dq | dp w*(q +3y) (q — žy) explipy + iloq + rp)} (2) 


The integration over p gives 27 6(y + 7) and hence the right-hand side of eq. (2.34) becomes 
) dq y*(q - 37) plg +37) e. 


In order to evaluate the left-hand side of (2.34) we note that according to the Baker-Hausdorff theorem 
(Messiah [1961]), if the commutator D = [A, B] commutes with A and B then 


eAtB = eÂ ef e D2 (2.35a) 
It then follows that 


ceria et a (2.35b) 


Hence, the left-hand side of eq. (2.34) becomes 
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ei"? (yle'? e' |p) . 
Next, using the fact that 
e” yx) = W(x + r) | (2.36) 


and transferring the e'’” to the left-hand side, this becomes 
ele! W(x)ly(x + 7) = | dx e+ YW (x) W(x + 7), (2.37) 


which is equal to the expression obtained above for the right-hand side of eq. (2.34). Thus, we have 
proved eq. (2.34) and hence also eq. (2.30). 
In summary, if a classical function 


A(q, p)= Í do | dr eent) a(g, 7) (2.38) 
goes over to the quantum operator 
A(4, p) = | do | dr e WM a(o, 7) (2.39) 


then the relation between A(q, p) and A is that given by Wigner in eq. (2.12). Furthermore, it is clear 
that if, for all A(p, q) 


f aq [ dp Pi. p) AG@p)= | da | ap P'a, p) Ala) (2.40) 


then P’ is identical with P. 
In addition, we mention that under the Weyl correspondence the classical quantity q"p” becomes 


nym 1 $ N\ an-r am ar 
a"p" 50 > (") a8 4 (2.41) 


r=0 


as can be seen by considering the or” coefficient in (og + 7p)"*”. 
Finally, we would like to mention the role played by the characteristic function. This is a description 
of the state 6 by means of a function of two new variables, o and 7, 


C(o, r) = Tr(ß C(a, r)), (2.42) 
where 
Clo, r)= eee) | (2.43) 


Distribution Functions in Physics: Fundamentals 285 


M. Hillery et al., Distribution functions in physics: Fundamentals 135 


Here we are following the nomenclature of Moyal which has now become standard in describing this 
quantity as a “characteristic function”. This description stems from statistical terminology, and, in 
particular, should not be confused with the sometime usage of “characteristic function” in quantum 
mechanics to denote an eigenfunction. 

C(o, T) is just the Fourier transform of P(q, p). To see this we note that the function corresponding 
to C(o, r) is just exp{(i/k)(oq + 7p)}. Making use of eq. (2.11) gives 


Clo, 7)= THA Clo, 7) = | da | ap” Pala p) 0.44) 
so that 
1 \? i 
P.(q, p)= (5) f do f dre INte Coit); (2.45) 


We can use the characteristic function to compute expectation values of Weyl-ordered products of p 
and q. We have that 


oms a CG. T) 


1 00” = = | dq | apa p” Pq, p), (2.46) 


o=17=0 
the right-hand side of which is just the average of the Weyl-ordered product q”p". 
2.3. Dynamics 


We would now like to derive equations for the time-dependence of P,. As before, our detailed 
considerations will be confined to one dimension but some results will also be quoted for the 
multi-dimensional case. The time-dependence of Py may be decomposed into two parts (Wigner 
[1932a]) 


OPa _ Pu Pw 
a a at (2.47) 


the first part resulting from the (i#/2m) 0?/dq? part, the second from the potential energy V/ih part of 
the expression for dy/dt. 
From the definition of Pw, given by eq. (2.2a), it follows that 


= i a7ys* * Š B Zipy/h 
Ae = | gy [a-y ary u]. me (2.48) 


at 


where we have taken advantage of the functional dependence of to replace 9?/dg? by 9*/dy?. Next we 
perform one partial integration with respect to y to obtain 


Pu _ -F | ay heroes plq- y)-y* atn]. 2ipy/ih (2.49) 


ôt 
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since the boundary term does not contribute. Switching back from 9/dy to 0/dq, we finally obtain 


Pu _ _ p OP ’ 
a= ~ , (2.50) 


This is identical with the classical (Liouville) equation for the corresponding part of dP/dt, as was 
mentioned at eq. (2.7). We next calculate 


zn = -a | dy ilVu*g + y) Yla- y)- y*la + y) [Vya - y} er” 


i i 
=e | dy [V(q+ y)- Vla- y) ¥*(q+ y) plg- Wei”. (2.51) 
Assuming that V can be expanded in a Taylor series, we write 
S yt 
Va+y)= X y VOO) (2.52) 
£A! 
where V®X(q)= ð* V/dq’. It follows that 


Pw _ 2 ere | 
| o IH Orava- ner, (2.53) 


where now the summation over A is restricted to all odd positive integers. It is clear that in the powers 
y* in the integrand we can replace y by (4/2i)(d/dp). It then follows that 


Pu _ sp 1 (AY 2? Vla) 9° Pol p) 
ot 2 A! (5) aq’ op»? en) 
A again being restricted to odd integers. An alternative form for 0,P,/dt is given by 
Ow : : : 
= | dj Pap + IG), 0.55) 
where 


Jg j)= | dy [V(q+ y)- Vq- y) e7% 
i Jr | dy [V(g + y)- V(q - y)] sin(2jy/h) (2.552) 


is the probability of a jump in the momentum by an amount j if the positional coordinate is q. The first 
part of eq. (2.55a) may be verified by inserting the Fourier expansion, with respect to y, of V(q+ y)- 
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V(q-y) into eq. (2.51). The second part is obtained by replacing the exponential by cos+isin and 
noting that the expansion in the square bracket is odd so that the integral of the cos part vanishes. 
In the multi-dimensional case where Pw = Py(qi,...4Gn3 P1, . » - Pn), the corresponding results are 


Pu S ar, sa V ae BN Py | 
ot ~ qu Mk 04k + Qu agh vag Al e An! opt!» apr’ (2.56) 


where the last summation has to be extended over all positive integer values of A,,...A„ for which the 
sum Aı + Az+°:: +A, is odd. 

The lowest term of eq. (2.56) in which only one A is 1 and the others vanish, and which has no A 
factors, is identical with the corresponding term of Liouville’s equation. Hence eq. (2.56) reproduces the 
classical (but non-relativistic) equation if A is set equal to zero. The ° terms give the quantum 
correction if this is very small. We will obtain a somewhat similar equation for the 1/T dependence of 
the distribution function of the canonical ensemble, which also is useful if the temperature T is not too 
low so that the quantum correction is small. 

Eq. (2.56) is the generalization of eq. (2.50) and eq. (2.54) for an n-dimensional configuration space. 
The same generalization of eq. (2.50) with eq. (2.55) is 


JA aP, f. | | | 
en DPT [an | dja Pan. dui Ba ji +- Pa + ja) ICi + Gai P+» Pr): (2.57) 
k 


where J(qi,.-- qn; jis ---jn) can be interpreted as the probability of a jump in the momenta with the 
amounts ji,- .. ja for the configuration q;,...q,. The probability of this jump is given by 


Jaan.. Qn ji- ja) = =A dyı | dyn [V(qi + Yi,- -qn + Yn) Vlgi- Vis --- Gn — Yn)] 
wh 
x expi-(2i/h)(yıjıt +++ + Ynja) (2.58) 


that is, by the Fourier expansion coefficients of the potential V(qi,... qn). 

From eq. (2.56) it is clear that the equation of motion is the same as the classical equation of motion 
when V has no third and higher derivatives as, for example, in the case of a uniform electric field or for 
a system of oscillators. However, there is still a subtle difference in that the possible initial conditions 
are restricted. This comes about because not all P(g, p) are permissible (see eq. (2.19b)). 

While we consider that the above form for the equations of motion (Wigner [1932]) are the simplest 
to use in practice, we will now discuss some other forms which occur frequently in the literature. 

Before doing so it is useful to take note of another relation, in addition to that given by eq. (2.23), 
which expresses the Weyl function corresponding to an operator F = AB in terms of the Weyl functions 
corresponding to A and B. This relation was first derived by Groenewold [1946] and was also discussed 
by Imre, Ozizmir, Rosenbaum and Zweifel [1967]. They find that the function corresponding to F is 


AB = F > F(q, p)= A(q, p) e** B(q, p) 
= B(q, pe" A(q, p), (2.59) 
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1-33_83 0.60) 


and the arrows indicate in which direction the derivatives act. Also (0/ap) (d/dq) is considered as the 
multi-dimensional scalar product of 0/dp and ôð/ðq, or, in other words, it is equal to (0/öp;) (8/dq;), where 
i=(l,...n) and n denotes the number of dimensions and, as usual, repeated indices denotes 
summation. 

To derive this result we first note that 


(allg) = | do expl(i/h) alg’ + g"V2} alo, q'- 4"), 2.61) 


where a is defined by eq. (2.32). This result follows from eq. (2.33) by taking the matrix element of both 
sides. A similar result follows for (q"|Blq’) except that a is replaced by £, the Fourier transform of 


B(q, p): 
BG. p) = | do | dr expf(i/h) (0 + 7P} Bla, 7). (2.62) 
We can now calculate F(q, p). We have from eq. (2.12) that 
= amp /„_Z AR z 
F@p)= | dze j (q 5 AB| q+5) 


a rap [a Zl Aly' '| É Z 
faz fage "(4 5| Ale) (edler) 


= | dz | dq’ | do | do’ e fh) o(a'+q—2/2)/2 efit) o'(a'+q+2/2y/2 


' z ’ 1,2 i z 
xa(o.g -q+5)B(o'.q-4 pjene (2.63) 
We now define two new variables of integration 7 = q’— q + (2/2) and 7'= q - q' + (z/2) so that 
F(q,p)= | dr | ar | do | do’ emer alo, newer men gen). (2.64) 


It is possible to replace the exponential factor exp{(i/#) (a't — or')/2} by exp(hA/2i) so that eq. (2.64) 
becomes 


F(q, p) = A(q, p) e^” BQ, p) (2.65) 


i.e. just the first expression appearing on the right-hand side of eq. (2.59). The second expression also 
follows readily from eq. (2.64). 
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We can also make use of eq. (2.64) to find an alternative expression for F(q, p) involving the Bopp 
operators (Bopp [1961] and Kubo [1964]) 


BE LEN -p2 
We first note that 
ilate AN) expli if 9 8 
exp lj GG 2i 7 j r(p tz all = exp; (oq + wp) exp{i(7 ri A (2.67) 
so that 
i = ho hð Gh) (o'q+7'p) — ali/h) loatt) 2 Gi/h) (a’r-or')/2 
exp lj [o(a 2i z) t r(p ta = e =e e (2.68) 


Using this result in eq. (2.64) we then have that 

F(q, p)= Í dr | dr’ | do | do’ ECO g(g, 7) e0 eaP BIg! 7’). (2.69) 
From eq. (2.33) we see that the expression 

A(Q, P)= | dr | do e! 02+) a(o, T) (2.70) 


is just the Weyl-ordered operator Á(å, ) with 4> Q and p— P. A(Q, P) is also an operator but not on 
the Hilbert space on which A(@, p) is an operator; it operates on functions in phase space. We can, 
therefore, express F(q, p) as 


F(q, p)= A(Q, P) B(q, p). (2.71) 


In a similar manner one can show that 


F(q, p)= B(Q*, P*) A(q, p), (2.72) 
where 

._,.,89 en. 

etz P= Poa aq ere) 


It is now possible to make use of the fact that the Wigner distribution is the function which is 
associated with (1/27h)p. The equation of motion for is just 


ih ößlör= [H, 6]. (2.74) 
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This implies that we have for the Wigner function 
ih öP,löt = H(q, p) e^” P.(q, p)- Polg, p) e*"”" A(q, p) 
or 
h aP,,/at = -2 H(q, p) sin(hA/2) P.(q, p), (2.75) 


where H(q, p) is the function corresponding to the Hamiltonian operator for the system, Ê. Actually, 
this is an abbreviated form of eq. (2.56) as can be verified by expanding the sin into a power series. Note 
that if we take the A > 0 limit of this equation we obtain the classical Liouville equation 


AP$ Jt + {P$ H}=0, (2.76) 


where {} denote Poisson brackets and the superscript c on Py, indicates the classical limit. For an 
H (q, p) which is at most quadratic in q and p, e.g. a free particle or an harmonic oscillator, eqs. (2.75) 
and (2.76) coincide. In these systems, then, the difference between a classical and a quantum ensemble 
is the restriction on the initial conditions in the case of latter (cf. eq. (2.19)). 

We also want to quote two alternate forms of eq. (2.75). The first follows immediately from our 
discussion of the Bopp operators. We have, using eqs. (2.65), (2.71), (2.72) and (2.75), that 


ih óP„lðt = [H(Q, P)- H(Q*, P*)] P.(q, p), (2.77) 


a result first obtained by Bopp [1961]. Analogous to the definition of A(Q, P), given by eq. (2.70), 
H(Q, P) is the Weyl-ordered operator with 4> Q and 5>P, where Q and P are defined in eq. (2.66). 
These equations do not exhaust the possible formulations of the dynamics of the Wigner function. One 
can also make use of propagation kernels. This approach is discussed by Moyal [1949] and Mori, 
Oppenheim and Ross [1962]. 

We turn now to a consideration of a canonical ensemble. If 6 = 1/kT where k is Boltzmann’s 
constant and T is the temperature, then the density matrix of the canonical ensemble is 


ee ee, (2.78) 


and Z(ß)= Tr(e~®"). The unnormalized density matrix, 2, then satisfies the equation 


aap = -AÂ =-OH, (2.79) 
subject to the initial condition AB =0)= Î where Î is the identity operator. Eq. (2.79) is referred to as 


the Bloch [1932] equation for the density matrix of a canonical ensemble. Using the product rule given 
by eq. (2.59) we have that 


ôN (q, p) ab = -H (q, p) e** Q(q, p)= -H(q, p) e 2G, p), (2.80) 


A being given by eq. (2.60) so that 
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62(q, pap = - H(q, p) cos(hA/2) N (q, p). (2.81) 


This is the Wigner translation of the Bloch equation, which was entensively studied by many authors 
and was first derived in this form by Oppenheim and Ross [1957]. It is useful in the calculation of 
quantum mechanical corrections to classical statistical mechanics. The initial condition for this equation 
is just the Wigner function corresponding to 2(8 = 0)= 1. Inserting J in eq. (2.12) we find that the 
initial condition is just 2(q, p)|gao = 1. 

It is also worth noting that P(g, p) does not satisfy the Wigner translation of the Bloch equation 
simply because of the fact that it must be multiplied by the 8-dependent factor (27h) ZB)i in order to 
obtain 2(q, p). 

Finally, we emphasize that all equations from eq. (2.59) onwards hold in the multi-dimensional case, 
where we simply interpret (q, p) to be (q;,... qn; Pis--- Pn) and the simple products in the exponents as 
scalar products. The solution of eq. (2.81) in the multi-dimensional case, is to order A? (Wigner [1932a]), 


aape ooie |S (EEE (XS) 5 Fam UN. om 


8m, Ogi 24m, Ik Kl 24m; m; 04x. 0G: 


Actually, the Wigner translation of the Bloch equation, eq. (2.18) above, can be simplified further into 
a form, analogous to that of eq. (2.56), which is more convenient for applications. This is achieved by 
writing the cos term as the real part of the operator 


Amex RE 9 
6 = exp > (3 a a (2.83) 


where we have used the explicit form for A given in eq. (2.60), again noting that the arrows indicate in 
which direction the derivatives act and that the gradient operators are 3N-dimensional. Next we 
decompose 0 by means of the Baker-Hausdorff theorem (eq. (2.35a)), and using the fact that 


ð ð : 
ap oq H(q, p)=0 1) 


it follows that we may write 


S= exp #2 2) exf- ih oO 
= exp|5 alt exp| rae ee) 


where we have neglected terms which do not contribute in the present context. Again because of eq. 
(2.84), and also using the fact that we are only interested in the real part, it follows that the only terms 
in 0 which contribute are 


ih ð J) Wd 8 6 d 
exp| - 2 rs 8 ap; dp; ðq: 29; ee) 


where i,j =1,...n (and as usual, it is understood that (8/0q)(0/dp) stands for (ð/ðq:) (8/3p;)). From 
henceforth, we will assume that we are dealing with a system of (n/3) identical particles of mass m. 
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Hence, since H = (p?/2m) + V, it follows that 


Eaa in 6 0 ER, P 
H0.0= {exp| - ae n £la, (2.87) 


where it is to be understood that the (ö/ög) term in _the exponential operates only on V and not on 2 
(whereas the ö?/äg? term operates on 2). Also, the 4/ap term has no effect on V and thus operates only 
on N. Since all arrows now operate to the right, they will be omitted from henceforth so that we finally 
obtain 


ap) [2 hô 4 ai] 
oß m" 0085 3 ôq 2) 4 8m aq? g (2.88a) 
(ur 22) ve 2) 
=| H+2sin E aq z) aa aq? R, (2.88b) 


where the d/dq term in the cos and sin terms is to be understood as operating only on V. Such a form 
was given for the first time by Alastuey and Jancovici [1980] and, in fact, their result also takes account 
of the presence of a magnetic field. We recall that (9/9p) (d/dq) is considered as the multi-dimensional 
scalar product of d/dp and ð/ðq, or, in other words it is equal to (0/dp;) (d/dq;) where i goes from 1 to n 
and n denotes the number of dimensions. Hence, the explicit form of eq. (2.88a) is 


n. 2 2 ALFA tAn gat tAn 7 Alt +An 
ap. fpe E E a + cary ie A a os 


op 2m 8m aq; 3 À 


Atı.!- An | dqr- bq ôq?" ap? ph 
where the last summation is to be extended over all positive integer values, as well as zero values, of 
Ai, A2,...An, for which the sum A, + A.+-+--+A, is even. This form for the Wigner translation of the 
Bloch equation is the most convenient from the point of view of applications. 

One of the earliest applications of these results was to the quantum corrections of the classical 
equations of state and to similar corrections to chemical reaction rates (Wigner [1932b, 1938]) and they 
have been extensively used in statistical mechanics (Oppenheim and Ross [1957]; Mori, Oppenheim and 
Ross [1962]; Nienhuis [1970], for example). However, we will defer a detailed discussion of applications 
to Part II of our review, to be published at a later date. 


2.4. An example 


We would now like to use some of the formalism which we have developed to actually calculate some 
distribution functions. The system which we will consider is the harmonic oscillator and we will consider 
both pure and mixed states. We will find the Wigner functions corresponding to the eigenstates of the 
harmonic oscillator and also the function corresponding to a canonical ensemble of harmonic oscillators 
at temperature T. 

The eigenstates of the harmonic oscillator are (Landau and Lifshitz [1965}) 


u.) = (E) (ir) ev? Hla), (2.90) 
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where A, is the nth Hermite polynomial and a = (mw/h)"?. Substituting this expression into the 
definition of the distribution function, eq. (2.2a), we find that 


Ula + y) Un(q- y) = es A exp{-a7[(q + yY + (4 - yY)/2 Halala + y)): Halala- y)) 


(2.91) 
so that 
= dl a 1 ee? 2ipy/h ea’? 
Pula, p)= Ge ee | dy e e Hulatq+ y)) Hala(a - y). (2.92 
We now note that 
a*y? — 2ipy/h = a?(y - ipla*h) + p?/a7h? (2.93) 
and define a new variable 
z=a(y—ipla*h). (2.94) 
We then have that 
Pa. P)= Ge nn he eno ef’ faz e* H,(@ag+z+ß)H,(agq-z-Bß), (2.95) 
where ß = ip/ah. Noting the H„(-x) = (-1)" H, (x) we find 
P«(9,P) = sa ae ur | dz e””’ H.(ag+z+ß)H,(z+ß-adg). (2.96) 
The above integral can be done (Gradshteyn and Ryzhik [1980]) and is 
| dz e"? H,(z + B + aq) H,(z + B - aq) = 2" Vr n! L,(2(a2q?- BP), (2.97) 
where L, is the nth Laguerre polynomial. Re-expressing a and ß in terms of q and p we have 
2 = A P_ 2\_ 2 
a%g?- pr= 2 (Z+ i ma?) =~ H(g,p) (2.98) 
so that (Groenewold [1946]; Takabayaski [1954]; Dahl [1982]) 
P.(q, p) = (A/mh) (-1)" e? L, (4H/ho). (2.99) 


Before discussing this result we will first calculate the distribution for an ensemble of oscillators at 
temperature T (Imre, Ozizmir, Rosenbaum and Zweifel [1967]). Here we proceed by way of the Wigner 
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translation of the Bloch equation (eq. (2.88b)) which for this system results in 


ap) | pi i. 2 hð ð Ai) 
BB (di + 2 mag?) +2 sin G 34 le ag N. (2.100) 


Because V is quadratic in q? it is clear that only the leading order term in the sin? expansion will 
contribute, and since 0? V/dq? = mw’, it follows that the Wigner translation of the Bloch equation for 
the oscillator reduces to 


bN (P 172, N 

aB (+3 t3 2 mw*q jatt Ta 8 (+ aq? us 20°) i (2.101) 
To solve this equation we make the Ansatz 

N(q, p)= exp{- A(ß) H + B(B)} (2.102) 


where A(0)= B(0)= 0, and H = (p?/2m)+ 3mw7q’. Substituting this into eq. (2.101) gives us 


dA py} dB h [i gA- 2924? ALB) 42 
(- gs mee -Ha+ [L (-mA + maq?A?)+ ma? ( +E) aja 
2 . 
= -HN+ Coy (-A+HA?)N. (2.103) 


This equation can be re-expressed in the form 


Hap) |- $+- GeF 42] 4 aa Ger a]=0. (2.104) 


Because this equation must hold for all q and p, and the terms in the brackets are independent of q and 
p, they must vanish independently, i.e. 


dA „(Aw ai 

FT 2 A’-] (2.105) 
dB „(Aw i 

FEB 7 A=0. (2.106) 


Eq. (2.105) can be integrated directly. One has that 


— up — an =Í dg (2.107) 


Or 


lea) es 
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Inverting this equation gives us that 
A(8) = (2/hw) tanh(ftwB/2) . (2.109) 


This can now be substituted into eq. (2.106) to give 
B 
B(B)=- te | dp’ tanh ("SE ) =-I cosh (*2£) (2.110) 
J | 


Therefore, we have 
(q, p) = sech(hwB/2) expl- (2/hw) tanh(hwB/2) H(q, p)]. (2.111) 
To complete our derivation we need to normalize the above expression. As was noted before the 


Wigner function is the function which corresponds to the operator (6/27h). From eq. (2.78) we then 
have 


Pa. P)= 33 218, 24?) (2.112) 


as {2(q, p) is just the function corresponding to e-®R, We also have from eq. (2.11) (setting A = e°” 
and B= I) 


Zip) = Tree?) = 4, | daf dp 2a p). (2.113) 


Substituting eq. (2.111) into eq. (2.113) we find 
Z(B) = Hsinh(hop/2)]" . (2.114) 
Finally we obtain for P,(g, p), from eqs. (2.111), (2.112) and (2.114), 
P.(q, p) = (L/Th) tanh(twB/2) exp[-(2/hw) tanh(hwß/2) H (q, p)) . (2.115) 


We now want to compare the two expressions (eq. (2.99) and eq. (2.115)) for Pw for the pure and 
mixed states, respectively. Examining the first few Laguerre polynomials 
Lo(x) =] 
L(x)=1-x (2.116) 
L»(x)=1-2x+x? 
we see that for the ground state of the oscillator P(g, p) > 0 while for excited states P,(g, p) can assume 
negative values. The result for the canonical ensemble, however, is always positive. It does not have the 


oscillatory structure which is present in the expressions given by eq. (2.99). The incoherence induced by 
a finite temperature leads to a much smoother distribution function. 
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2.5. Statistics and second-quantized notation (Klimintovich [1958]; Brittin and Chappell [1962]; Imre, 
Ozizmir, Rosenbaum and Zweifel [1967]) 


When one is dealing with more than one particle one has to include the effects of quantum statistics. 
To illustrate how these effects come in to the Wigner function we will first consider an example. We will 
then show how the Wigner function can be expressed in second-quantized notation. In this form it is 
easier to take the effects of statistics into account, but two of us have an article in preparation 
(O’Connell and Wigner [1983]) which not only will take the effect of statistics into account, but will also 
include spin effects. 

Let us consider two identical particles in one dimension in a harmonic potential well. We will further 
assume that the particles are bosons. The Hamiltonian for the system is 


A 


ae Een 
= (Bit pa) + imow (G+ 43). (2.117) 


Suppose that we want to find the Wigner distribution for a canonical ensemble of these systems at a 
temperature 7. We would again like to use the Wigner translation of the Bloch equation but now we 
must be more careful; the initial condition is no longer so simple. 

To see this we first find the density matrix for the system. The eigenstates of the Hamiltonian given 
by eq. (2.117) are 


5 (Unl4s) Unda) + Un(Gs) Una) if ms > m 


ban, q2) = (2.118) 
U„,(91) Un.(42) if ny = Mm, 

where U,„(g) is given by eq. (2.90). This state has an energy E,„ given by 

Eu = hw (nı + Not 1) ; (2.119) 
The unnormalized density matrix for this system is just 

A = > exp(-BEnm) lbnim) (Pnn . (2.120) 

nızn2 

In the 8 > 0 limit this becomes 

M(B =0)= I løn) nml- (2.121) 


nizn2 


Taking matrix elements we find 


(qi, 42,2(B = 0)\qa, g2)= E, HU„(g;) Unli) + Uni) Un(a) 


ni>n2 


x (Um (41) Un.(g2) + Un.(qı) Un(g))+ > Un (q1) Un (43) U* (q1) UX (aa) 


= 2 > (U,.(41) U„.(q2) U a(g) U m(42) + Un (q1) U„(42) U (92) U „.(qı)) $ 
nin? 
(2.122) 
We can now make use of the identity 


Distribution Functions in Physics: Fundamentals 297 


M. Hillery et al., Distribution functions in physics: Fundamentals 147 
> U,(qi) Už) = ö(qı - q1) (2.123) 

to give 
(qi, RB = Oq, 92) = Hölgi - 91) 8(95- G2) + Ölqi- 4) 8(43- q)]» (2.124) 


as was to be expected. If we operate on an arbitrary two particle state, |W), with AB = 0) we have that 


(qi, 908 = Ol) = | dg: | da; (qi, 451648 = Ola, qd (au, gal? 


= 3[y(q1.92)+ Yla 91) - (2.125) 


If y is symmetric the result on the right-hand side of eq. (2.125) is y, if y is anti-symmetric the result is 
0. 

Therefore, A(ß = 0)is just the projection operator, Ps say, onto the state of symmetric two-particle wave 
functions. This result is also true for an arbitrary number of particles, N. Our result that SB = 0) is Ps was 
derived for bosons. Similarly, if the particles are fermions (6 = 0)i is P,, the projection onto the space of 
anti-symmetric N-particle wave functions, but in this case, the spin variable should also be included. 

Returning now to our example we want to find the initial condition for the Wigner translation of 
Bloch equation, i.e. we must find the function corresponding to Ps. Making use of the two-particle 
extension of eq. (2.12) we find 


Nqı, 92; Pi, P2) = f dyı | dyz exp{(i/h) (Pry: + P2 Y2} (qi -3y1,92-byalPalgı + dy, G2 + 3y.) 


2 f ig; | dyz exp{(i/h) (p1 yı + P2y2)} (y1) ö(y2) 


+ ö(92- qı +3(y2+ yı)) (4: - Gat 3(yı + y2))] 
=3+ mh 8(qi- q2) 5(pi— Pa) - (2.126) 


The corresponding result for fermions has a minus sign in front of the second term. This initial 
condition is considerably more complicated than the initial condition, 2(q, p)= 1, which was obtained 
in the one-dimensional case. The situation rapidly becomes worse with larger numbers of particles. 

Second-quantized notation provides, in principle, a convenient way to deal with the problems 
imposed by quantum statistics. We will consider a Fock space and designate the vacuum state of this 
space by |0), and the quantized field operators at the point r by ý*(r) and ir). The interpretation of 
the field operators is that #*(r) adds a particle at point r to the system whereas ý(r) removes a particle 
at point r. They are defined as 


ý(r)= 2 7 e? â, (2.127a) 


ý*(r)= 2 Ze "a; (2.127b) 
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where the so-called annihilation and creation operators, â, and â}, respectively (discussed in detail in 
section 4), act to remove or create a particle of momen p in a box of volume V. For bosons these 
operators obey the commutation relation 


[V(r), Hr] = Pr- r) (2.128a) 

[ý(r), ġir) = 0 (2.128b) 
and for fermions the anti-commutation relation 

Wr), p'r h= 6(r - r') (2.1292) 

Wr), do) = 0. (2.129b) 


To every N-particle state |Wy) in the Fock space corresponds an N-particle wave function given by 
(Schweber [1961]) 


1 2 x 
Pole. Pn) = Ze Mi lr Pw). (2.130) 
The distribution function for the state |Y,) then, is given by 


1 \3N 
P(r... fn; Pi,--.Pn)= (3) [ey [ey expt@/A)(pı'yıt"+pn"yn)} 
x rl Yılrı-3Y1,... NEN) 
3N 
Gar Sn ner: | Eyn exp{(i/A) (pit yı ttt + Pw yn)} 


x peri 3m)‘ kr -3yı)| ÝN) (Puli (ry +3yı)° urn + 5yn)I0) . 


(2.131) 
This expression readily extends to N-particle density matrixes, py, so that 
3N 1 a 
P(r, ... tw; Pi... Pw) = (=) wi) Py: | Pyy exp{(i/h) (pic yi +++: + Pr yn)} 
x i —3yn)°°° Wr, = 5y1) Pn ý (r + 591) ree U" (rn + 3yn )|0) , 
(2.132) 
where, in the case of a pure state, 
Pn = Øn) (Dy | (2.133) 


with |®,,) denoting the N-particle ket basis vector. An N-particle density matrix has the property that if 
Pr: and Ëy- are N'-particle and N”-particle states respectively, then (Dyrjön|®n’) = 0 unless N’ = N” = 
N. Therefore, eq. (2.132) can be expressed as 
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1 \3N 
P(r... FN; Pi. - - PN) = (55) [er [Em exp{(i/h) (pit yi + +++ + Py Yn} 


x Tr(i(rw 398) br — 391) Bw O(n t By) W'(tw + 3yw)) 
LN g : 
= (55) Si dyi: | d’yn expl(i/t) (prey + -7+ Pw * Yw)} 
x Tr(pn ur, + 2y1) n ý’ (rn + dyn) brn z ZYN) Jei Wr, -3y1)). (2.134) 
This is the desired expression for the Wigner function in second-quantized form (Brittin and Chappell 
[1962]; Imre, Ozizmir, Rosenbaum and Zweifel [1967]). 
It is also possible to derive expressions for the reduced distribution functions in terms of the 


quantized field operators (Brittin and Chappell [1962]; Imre, Ozizmir, Rosenbaum and Zweifel [1967)). 
The distribution function of order N, reduced to the jth order, is defined as 


Piru..-r;Pu---Pj)= | i | d’rn | ce | d’pn P(ri,... FN; Dis --- PN) (2.135) 


and this definition will be used for the rest of this section. This can also be expressed, by making use of 
eq. (2.134), as 


1 \3" ] 
Pr... Fj; Pry +» - Pi) = (=z) anf Ena [on [en [em [en | ys 
x exp{i(pie yi t+ °° + Pw -yn VA} Tr(önb*(ri +3yı)°° ý (rN +3yn) (rn -3yn)' Wr, -3yı)) 
1\3 1 an 
= (>) [ern | Brn | dyi: [ay expfi(pisyi t+ + Pi YA} Trlön "(ry + 292) 


o W"(r, + by) = (rn) urn) j ren) ur, = 3y) Hr ur, = 3yı))- 


(2.136) 
In order to analyze this expression further we first note that 
| Prý‘ d(ir)=N, (2.137) 
where N is just the number operator. We then have that, for both bosons and fermions 
lýt), N]= Hr). (2.138) 
Therefore, 
| Erden) NU (rw_1) = N (N-1) (2.139) 


and 
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[era [ern Hee) PO) N Bier) = NND -N FFD. 
(2.140) 
Eq. (2.136) becomes 


1 \F 1 ; 
Pi(ri, ... Fj; Pas --- Pi) = (=) wil Pues f ey expli(piyıt +++ p yA} 
x Tr(ôný (r+ iy) Éti) (Ñ -1 (NAN ++ ilr- iy) Ori y). 


(2.141) 
Because py is an N-particle density matrix we have that 


N(N-1)---(N-N+j+ Dlr -iy dr — yn) by =N- ól- by) Hr — 3) n, 


(2.142) 
so that our final expression for the reduced Wigner function is 
1 \ (N-—j)! . 
P;(ri, -< Ti Pas + + Pi) = (zzi) OAE f ay fey expti(pis yi t+ + py yA) 
X Tr(6n Ge +3yı)° U(r, +3) ý(r; sgy) ur: —3y1))- (2.143) 


It is now possible to formulate the dynamics of this theory in a way which is independent of the 
number of particles. We first go to the Heisenberg picture in which the field operators become time 
dependent. We then consider the operators 


A LP DA : 
Blt np (Fag) d’yı | By, expli(prs yi t + py y\/h} 
tt; Oo blr; t). (2.144) 


The distribution functions for an N-particle theory are then just 


, N-j)l.. ¢ 
P;(r,, eee Dj Pih- Tone, ...T,Pı,-- .Pj)) (2.145) 


We see that in this formulation all of the dynamical information is contained in the operators Ô, which 
contain no reference to a specific particle number and also contain the information about the statistics of 
the particles. Thus, in principle, the second-quantized formalism should be a useful starting-point for 
the incorporation of statistics into problems involving a system of identical particles. However, it must 
be admitted that -to our knowledge - no application has been made along these lines. 


3. Other distribution functions 


We now want to examine certain other distributions besides the one considered so far. These may 
arise out of a desire to make use of an operator ordering scheme other than that proposed by Weyl or a 
desire to have a distribution function with certain properties. For example, we may want to make use of 
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symmetric ordering 
q” p” >3(4"p" + frå"), (3.1) 


in which case we would use the distribution function (Margenau and Hill [1961]; Mehta [1964]) 


PAG, P)= zug Re {vO | ay e yra-y)}- 62) 


On the other hand, we may want to consider a distribution which is always greater than or equal to 
zero. We will discuss a distribution which has this property shortly. 

A scheme for generating distribution functions was proposed by Cohen [1966] and further examined 
by Summerfield and Zweifel [1969]. They give the rather general expression 


1 \ Š 
P,(q,p) = (=) Í do | dr | du exp{-(i/A) [o(q - u)+ tpl} g(o, r) y (u - 5) y(u + 5) (3.3a) 
= | dq’ | dp' 8(q- q', p - p') P(g’, p’) (3.3b) 
for the distribution function of the pure state ı,(q), where 


#(a,p)= | do | dr expl-(AXog + Pl} glo, 7). 6.4) 


Thus the function P is simply the original function P,, smeared with another function g. The basic 
requirement which leads to eq. (3.3) is that P transform correctly with respect to space displacement, 
W(q)> b(g- a), and transition to a uniformly moving coordinate system, #(q)—> exp(—imugq) (q). 
These requirements were formulated in giving the form eq. (3.3a) to P,-and the satisfaction of the 
requirements can easily be verified; eq. (3.3b) then follows. 

Cohen also pointed out that it is possible to obtain distributions whose dependence upon the wave 
function of the system is other than bilinear simply by choosing g(o, r) to depend upon w(q). For 
example, one can choose 


8 (0, 7)= | dq v(q = oF) y*(a j 905) (3.5) 


where qo is an arbitrary value of q. This choice for g(a, 7) satisfies g(0, r)= g(o,0)=1 so that the 
correct marginal distributions are obtained. On the other hand, we now have the rather awkward situation 
that the function-operator correspondence depends upon the wave function. An even simpler choice is, of 
course 


P,(q, p) = (ah) |W(Q)P? |o(p)? (3.6) 


where ¢(p), the Fourier transform of (q) is defined by eq. (2.14). The conditions on (q, p) which must 
be satisfied so that the correct marginal distributions are obtained are 
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| da 88, p)= Ort) (p) (3.72) 
| dp 34,0 = Crh? èla). (3.76) 


One choice of £(q, p) which does not satisfy eqs. (3.7) but which is interesting nonetheless is given by 


pap a) = (Jee orm, 3.8) 


The use of this smearing function was first proposed by Husimi [1940] and has been investigated by a 
number of authors since (Bopp [1956]; Kano [1965]; McKenna and Frisch [1966]; Cartwright [1976]; 
Prugovecki [1978]; O’Connell and Wigner [1981b]). It leads to a distribution function, Py(q, p), where 
the subscript H denotes Husimi, which is non-negative for all p and q. One can see this by noting that 
&(q - q', p— p’; a) is just the Wigner distribution function which one obtains from the displaced (in both 
position and momentum) harmonic oscillator ground state wave function 


Wap (q's a) = (way aa ica (3.9) 


which we will call P,, (O’Connell and Wigner [1981b]). If the Wigner distribution in question, P,, 
corresponds to a wave function ġ(q) we have 


Pala, p)= | da’ | dp’ Paola’, P) Pala’, p) = (sq) || da uzan og) =O, 6.10) 


where we have used eq. (2.8). Note that in order to get a positive distribution function we had to violate 
condition (ii) on our list of properties of the Wigner function. Property (vi) is also violated as was shown 
by Prugovecki [1978] and by O’Connell and Wigner [1981a]. 

We will encounter Pu(q, p) again in the next section in a somewhat different form. It is the “Q” or 
“anti-normally-ordered” distribution function of quantum optics. It is one of a number of distributions 
which are useful in the description of harmonic oscillators, and, hence, modes of the electromagnetic 
field. We now proceed to examine these distribution functions. 


4. Distribution functions in terms of creation and annihilation operators 


The harmonic oscillator is a system that is ubiquitous in physics, so that it is not surprising that 
quantum distribution functions have been developed which are tailored to its description. It is in the 
description of the modes of the electromagnetic field that these distribution functions have found their 
widest application. 

It should be emphasized that many problems in quantum optics require a fully quantized treatment 
not only of the atoms but also of the field. For example, an analysis of experiments dealing with photon 
counting or a derivation of the fluctuations in, intensity of a laser near threshold both require the 
quantum theory of.radiation (Scully and Lamb [1967]; De Giorgio and Scully [1970]; Graham and Haken 
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[1970]). The latter is developed within the framework of annihilation and creation operators for bosons (see 
below) but it is then possible to go to a description in terms of c-numbers (while fully retaining the quantum 
aspects of the situation) by means of distribution functions. In most cases, this greatly facilitates the 
calculation while, at the same time, it contributes to a better understanding of the connection between the 
quantum and classical descriptions of the electromagnetic field. 

A number of studies of these distribution functions have been done (Mehta and Sudarshan [1965]; 
Lax and Louisell [1967]; Lax [1968]; Cahill and Glauber [1969]; Agarwal and Wolf [1970]; Louisell 
[1973]). We will rely most heavily upon the papers by Cahill and Glauber [1969] in our treatment. Their 
discussion considers a continuum of possible operator ordering schemes, and hence distributions (an even 
larger class is considered in Agarwal and Wolf [1970]) but we will consider only three of these. A final 
section will discuss distributions defined on a 4-dimensional, rather than a 2-dimensional, phase space. 

We will describe the system in terms of its annihilation and creation operators 


gs (se) (ag+ ô) (4.1a) 

= (8) lain) aw 
satisfying 

[â â*]=1. (4.2) 


As mentioned before, it is assumed that the field operators we consider obey Bose statistics. Each 
pair of â, ä* refers to a certain function of position. These functions form an orthonormal set which is 
countable if the basic domain is assumed to be finite, and continuous if infinite. We deal with a very 
large, but finite, system so that the system is only approximately relativistically invariant (exact 
invariance is achieved for an infinitely large system, but this would make the calculation in other ways 
difficult). 

The various functions of â and â* are investigated individually because the corresponding â and d* 
do not interact with the â and â* of another member of the set. They interact with the matter which is 
in the basic domain. Thus, for example, when we apply this formalism to the case of the electtomag- 
netic field, we investigate each mode (corresponding to a definite momentum and definite direction of 
polarization) separately, and the operators associated with different modes commute (no interaction 
between modes). In addition, there will be a distribution function corresponding to each mode. 

The a and â* operators act on the basis vectors |n}, the so-called “particle number states”, and have the 
properties: 


G|n)=Vn|n-1) (4.2a) 
a@* |n)=Vn+1|n+1) (4.2b) 
G*a\|n)=n|n) (4.2c) 
4|0)=0. (4.2d) 


In addition, one can prove that 
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[4, (a*)"]=n(a*y"". | (4.2e) 


If we are considering an oscillator of mass m and angular frequency w we take A = (mw)'” and if we are 
considering a mode of the electromagnetic field of angular frequency w we set A = (A? w/c). 

We also want to consider a special class of states known as coherent states (Schrodinger [1926]; 
Glauber [1963a]; Glauber [1963b]; Sudarshan [1963]; Glauber [1965]). To define these we first define 
for each complex number a the unitary displacement operator: 


D(a) = elaä’-a”d) — e ler erat aa" (4.3) 


where the last expression is obtained by use of the Baker-Hausdorff theorem (eq. (2.35a)) and the 
commutation relation given by eq. (4.2). The operator D(a) has the property that 


D"'(a) 4 D(a)= data (4.4a) 
D(a) d* D(a)= â*+a*. (4.4b) 


The proof of eq. (4.4) readily follows from eqs. (4.2e) and (4.3). We now define the coherent state 
(Glauber [1963a]; Glauber [1963b]; Sudarshan [1963]), which we denote by |a}, as 


la) = D(a) 0) = e7”? > (a*)" |) = er S gial), (4.5) 


n=0 


where |0) is the ground state of the oscillator. This state has the property that it is an eigenstate of the 
annihilation operators with eigenvalue œ. Again, this can be verified by using eq. (4.2e). Perhaps it 
should be emphasized that the symbol a always refers to a complex eigenvalue whereas |a) always 
denotes a state, just as n denotes a real eigenvalue and |n) a state, the so-called “number state”. Also, 
just as |n) refers to a definite state of excitation of a system of one mode, |æ) also refers to a state of one 
mode. 

The |a) states are not orthogonal but they are complete (in fact overcomplete). Explicitly, 


(Bla) = exp[-3(la|’ + |B?) + B*a], (4.6) 


which follows immediately from eq. (4.5) and the fact that the number states are orthonormal. 
Furthermore, it is possible to express the identity operator as 


I= =| da |a) (al, (4.7a) 


where d?a = d(Re a) d(Im a) = 3da da*. The proof of eq. (4.7a) follows by setting a = r e°, so that 
da = r dr d0, and then using eq. (4.5) to get 


* [ee la) (a| => 5 Daye ca er le i(m -n)6 dé = | dr er ar 


T m=0 n=0 


(4.7b) 
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where we have used the fact that the angular integral simply equals 276,,,. The latter radial integral 
equals n!/2, so that using the fact that 2, |n) (n| = 1, eq. (4.7a) readily follows. A direct consequence of 
eq. (4.7a) is that the trace of any operator A is just 


Tr(A) = =| Pa (a|Ala). | | (4.8) 


It is also of use to compare the expression for the displacement operator D(a) to our previous results 
and use this comparison to derive an expansion for a general operator A in terms of D- (æ). This will 
be of use later. First we note that if we set a = (2h)""? (Ar + iA "'o) then (see eq. (4.1)) 


D(a) = exp{(i/h)(og + tp} = Clo, 7), (4.9) 


where C was defined earlier by eq. (2.43). Thus from eqs. (2.42) and (4.9) it is clear that the 
characteristic function is the expectation value of the displacement operator. This in conjunction with 
eqs. (4.5), (4.6) and (4.8) gives 


Tr(D(a) DB) = 7 8% (a - B), (4.10) 


where the ô function here is 57(¢) = ö(Re £) (Im £). Suppose that we can expand the operator A(, ä*) 
as 


=< | Pes) De. (4.11) 
Using eq. (4.10) we find that 


g(&) = Tr(A D(é)). (4.12) 


It can be shown (Cahill and Glauber [1969]) that if A is Hilbert-Schmidt (i.e. Tr(Á* Á) < ©) then the 
function g(£) is square integrable. 

The three types of ordering of the operators â and ä* which we wish to consider are defined as 
follows: 

(i) Normal ordering- A product of m annihilation operators and n creation operators is normally 
ordered if all of the annihilation operators are on the right, i.e. if it is in the form (â*)" â". 

(ii) Symmetric ordering- A product of m annihilation operators and n creation operators can be 
ordered in (n+ m)!/n! m! ways. The symmetrically ordered product of these operators, denoted by 
{(ä*)” a}, is just the average of all of these differently ordered products. For example 


{a* 4} = 4(4* a+ ââ*) (4.13a) 
{â+ â} = 4(G* G2 + Ga* G+ Ga") (4.13b) 
(a*?a?} = (4°74? + G* Ga* G+ G* G?G* + âât â + da* Ga* + 4747”). (4.13c) 


(iii) Anti-normal ordering- A product of m annihilation operators and n creation operators is 
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anti-normally ordered if all of the annihilation operators are on the left, i.e. if it is of the form å” (ä*)". 

For each operator ordering we have a rule which associates a function of a and a* with a given 
operator. The rule is as follows: for any operator ordering scheme the product of m annihilation and n 
creation operators, ordered according to that scheme, is associated with the function (a*)"a@™. For 
example, if we are considering normal ordering the product (4")"a™ is associated with (a*)"a”™; if 
anti-normal ordering is being considered then â” (@*)" is associated with a”(a*)”. We will now make 
the meaning of our rule more explicit by considering each of these orderings and its associated 
distribution function. 


4.1. Normal ordering 


Let us suppose that we can expand a given operator A(4, 4*) in a normally ordered power series 


A= > Cm yâ". (4.14) 


n,m =0 


Let us further suppose that we can express the density matrix as 
je | Pa P(a)ja) (al (4.15) 


where P(a) is a c-number and the state |æ) is given by eq. (4.5). P(«) is called the P-representation of 
the density matrix (or the distribution function representing the density matrix) of the particular mode 
under study. It should be emphasized that both the real and imaginary parts of a are used as the 
variables of the distribution function. Also, it is probably worthwhile mentioning again that our 
discussion is restricted to a system of bosons and thus the distribution functions under study are not 
applicable to, for instance, a gas of neutrinos. Also, we are dealing with a very large but countable set 
since we assumed that the basic domain is finite. 
From eqs. (4.10) and (4.7a) and because (a|4*"4"|a) = a*"a”, it follows that 


Trtäß)= | PB | a Pia) (BlAla) (alp) = | Pa Pla) al Ala) 


| = | a P(a)| > com(a*yra” |= | Pa P(a) Anla, a”), (4.16) 
with 
Ax(a,a*)=(alAla)= I cmla*a”. (4.17) 


Therefore, we associate the operator Ä(ä,ä*) with the function Ana, œ*) in the evaluation of 
expectation values with the P-representation. 

We now want to derive two expressions for P(a) in terms of the density matrix. It is not always 
possible to find a useful representation of 6 of the form given by eq. (4.15). For some density matrices 
P(a) would have to be so singular that it would not even be a tempered distribution (Cahill [1965]; 
Klauder and Sudarshan [1968]). This difficulty will be apparent in our formal expression for P (a). 
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Let us now choose for the operator A 
A = eff" eE, (4.18) 


The corresponding function is then An(a, a*) = exp(£a* — é*a). Inserting these expressions into eq. 
(4.16) we find that 


xn(&) = Tr(p e® et?) = | d’a P(a) eE", (4.19) 


The function xn(£) is known as the normally ordered characteristic function. The right-hand side of eq. 
(4.19) is just a Fourier transform in a somewhat disguised form. In fact one has that if 


f(a)= + [ Beet) (4.202) 
then 


FEI | Pa ef f(a), (4.206) 


and vice versa. Therefore, we have for P(a) 


1 ass 
P(a)= -3 | Een). (4.21) 
The problem with this expression is that xn(£) can grow rather rapidly. In fact we have that because 
exp(éa” - £*ä) is unitary 
Inte) = Tr ef Ea] < elf? (4.22) 


which suggests the type of behavior which is possible. For example, if 6 = |n) (n|, where |n} is the 
eigenstate of the number operator with eigenvalue n, then for large |é| we have |yn(€)| ~ |é|?". This 
representation, then, is not appropriate for all density matrices, but, nonetheless, is useful in many of 
the cases of interest. 

Finally, we will derive an expression for P(a) in terms of a series expansion for the density matrix. 
Let us suppose that we can express the density matrix as an anti-normally ordered series 


B= È På” lâ" (4.23) 


If we again consider the expression for Á (â, â*) given by eq. (4.14) we find that 


TrGA)= DD pamcns Tr(d”(ä’) (0V â"). (4.24) 


nm =0 r,s =0 


The trace in eq. (4.24) can be expressed as 
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Tr(a” (a “ya ‘j= Tr((ä ab aie a’*™) 


so that 


Tr(6A) == | Pa S SD Pnmena*a”a*a’. (4.25) 


n,m =0 7,50 


Comparing this with eq. (4.16) we see that 


P(a)== I Imre”. Ä (4.26) 


The difficulties which we had when considering eq. (4.21) suggest that we will have similar problems 
with eq. (4.26). In fact the problem goes back to eq. (4.23). The class of operators for which a 
meaningful anti-normally ordered expansion exists is highly restricted. One can see this by considering 
the representation for an operator given by eq. (4.11). Expand D’'(£)= exp(£*ä) exp(-£ä*) exp@lé[) 
in an anti-normally ordered power series and insert it back into eq. (4. 11). This gives us an 
anti-normally-ordered power series for Ä: 


A= © dmä”(är), (4.27) 
with the coefficients given by 
1 1 Ary 2 n(¢ex\m 
dam = ni | PETHADE)) (EP EY". (4.28) 


For these coefficients to exist Tr(AD(€)) must be a very rapidly decreasing function of él. Our previous 
remarks indicate that this will not be true in general for Hilbert-Schmidt operators and, in fact, will not 
be true in general for operators of trace class (operators, A, for which Tr({A‘A]"?) < œ) such as density 
matrices. 

It should be mentioned that normally-ordered power series expansions are far better behaved. A 
derivation similar to the one above gives for the coefficients Cam in eq. (4.14) 


i | Pe THAD) eH Ee. (4.29) 


This clearly exists for a much wider class of operators than does dam. The c„m’s exist, in fact, for all 
Hilbert-Schmidt operators and the series converges in the sense that if one takes its matrix element 
between two coherent states, («| on the left and |ß) on the right, the resulting series converges to 


(al AlB). 
4.2. Symmetric ordering 


Before proceeding with a discussion of the distribution function for this case we would like to 
:onsider a few properties of the ordering scheme itself. We first note that 
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(a+ hay =D eres (Taya, (4.30) 
I=0 
which implies that 
t+ = 1 m a+\m A 
etree SY gr Ela" a. (4.31) 


lm =0 


Our operator-function correspondence is now done in a way analogous to that of the preceding section. 
Expand an operator A(d, ä*) in a symmetrically ordered power series 


A= > bam KA ar}. (4.32) 
The function corresponding to the operator is then 

A,la, a*) = > bam (a° a”. (4.33) 
Under this correspondence we see from eq. (4.30) that the function Dé) goes to 


D(é) = eisen, (4.34) 


Comparison with eq. (4.9) shows us that this is nothing other than Weyl ordering expressed in a 
different form. The distribution function, therefore, should be the Wigner function. As before we define 
this as the Fourier transform of the characteristic function x(£) (see eqs. (2.42)-(2.45)) and we use the 
real and imaginary parts of a = a, + ia; as the variables of the distribution function, so that, analogous 
to eq. (4.1), a = (2h)? (Ag + (i/A)p), where A = (mw)"?. Thus 


1 


W(a)= + | ae (9) = | | dé, dé, et) Trfp een") 


= 1f | dé, dé; Tr| 6 exp| 2ig,( a- on) - ne (a, - on) | » (4.35) 


where 


x(@)= TG DE). (4.36) 
It may be verified, using eqs. (2.42), (2.45) and (4.35) that 


W(a)= ah) Pal p)=2 | dy E E tq- yin) nlölm) (mlg + y) >" 


ze j y22 (4) (a) re" HBG + y) RB y) W dm. 
= (4.37) 
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where, in the derivation of the last line from the previous line, we have used eq. (2.90) and where 


P = (molh)? and (n|ölm) = didn. 
Examination of eqs. (4.11) and (4.34) shows us that the function A, which corresponds, by eqs. (4.32) 
and (4.33), to the operator A(d, â*) can also be represented as 


Aa, a*)== f dé TAD (E) ee". | (4.38) 
We would now like to use this to show that 

Trßä)=+ | da A,la,a*) W(a). (4.39) 
Evaluating the right-hand side we see that 

+ | Pa A,(a, a*) W(a)= = | Pa Í dE Tr(AD(€)) ef- Wla). (4.40) 


Making use of the relation 


a) = = | eer, (4.41) 
we find that 
= | Pa ee" W(a)= x(-2). (4.42) 


We also have from eqs. (4.11) and (4.36) 


„| PExCB@=< | PExOD-W=s, (4.43) 
so that 

= | Pa Ada, a*) W(a)=+ | E HABE) x(-€)= THAS), (4.44) 

T mT 


which proves eq. (4.39) and shows that A,(a, a*) and W(a) can be used to calculate the expectation 
values of symmetrically ordered operators. 

We would also like to say a word about symmetrically ordered power series. Comparison of eqs. 
(4.11) and (4.31) allows us to calculate the coefficients appearing in eq. (4.32) 


bam = E | PETAD CE E. (4.45) 


Ham 
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These coefficients, then, will exist for all operators which have the property that all moments of 
Tr(AD(£)) are finite. While this behavior is not as good as that for a normally ordered power series it is 
certainly better than that of anti-normally ordered series. 

It is also of interest to examine the behavior of W(«). First we note that 


2 eeka | Pe THT E) Dp] Ts | (4.46) 


so that y(£) is a square integrable function. As W(«) is just the Fourier transform of x(£) it too is 
square integrable. Therefore, W(a) is far better behaved than P(a@) and will exist for all density 
matrices. 

It is also possible to express the Wigner distribution in terms of the P representation. If we can 
represent the density matrix as in eq. (4.15) we then have that 


x)= [ PEPENE eE: Ip) = | cep Pes rer. (447) 
Taking the Fourier transform of x(£) gives us, with the use of en (4.35) and (4.37), 
W(a)== | ae | BB Pier er ner ee 
= if dB | dE P(B) ela-BE*-(@*-B*)E-IER2 


=2 f dB P(B) e?e., (4.48) 


4.3. Anti-normal ordering 


Let us suppose that we have an operator given by an anti-normally ordered power series as in eq. 
(4.27). The function corresponding to the A of eq. (4.27) is then 


A,(a, a*)= > dum a” (a*)”. (4.49) 
nm=0 
By analogy with our discussion of the P representation (eq. (4.26)) we can then express Å(åâ, â*) as 
A(é, â*)= 1 | da A,la,a*)|a)(al. (4.50) 


We then have that 


Tr(fA) = 1f d’a A,(a, a*)Tr(la) (al) = | da A,(a, a*) O(a), (4.51) 
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where we have set (Kano [1965]) 
1 53% 
Qla) =z (alóla). (452) 
This distribution can also be expressed in terms of a characteristic function 


xalé) = Tr eE? ef”). (4.53) 


We have that 
Xale) = | da tale“ petta) = | da e-e (alóla) (4.54) 
so that | 
Qla) = <a | Bee yale) 
=| | oe dg ere ef (glölß)= È (alla) . (4.55) 


Again by considering our derivation of the P representation we can derive an alternate expression 
for A,(a, a*). Examining eq. (4.21) we see that 


Ala, a*)= + | BE er -et Tr(A ef em FA), (4.56) 


The “function” A,(a, a*) has, of course, all of the singularity problems of the P representation. 

The distribution function, Q(a), has, on the other hand, no singularity problems at all. It exists for all 
density matrixes, is bounded, and is even greater than or equal to zero for all a. The problems in this 
ordering scheme arise in the representation of the operators. 

As a final remark, we note that all of the distribution functions can be written in terms of the Wigner 
distribution function (McKenna and Frisch [1966]; Agarwal and Wolf [1970]; Haken [1975]; O’Connell 
[1983b]), by use of integrals or derivatives. 


4.4. Examples 


We would now like to calculate Q(a) and P(a) for a single mode of the radiation field of angular 
frequency w. The system which we will consider will be a canonical ensemble at temperature 
T = (kB)*. Our discussion will follow that given in Nussenzveig [1973]. 

We first consider the anti-normal distribution function Q(a@). The density matrix for this system is 


p= (em) Se In) (nl. (4.57) 


n=0 
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For Q(a) we than have from eq. (4.52) 


Qla)=+- erm) D e (aln) (nla) 


n=0 


= 1 — a~ Bħw = —nBhw „-lal? lal?” 
= er) 2 e eTel 
= i (1- ee) exp[-le}? (1- e*-)]. (4.58) 


To obtain P(a) we make use of our result for Q(a@). We first find y,(&) from eq. (4.54). If we set 
s=(1-ee), €=xtiy, a=rtik (4.59) 


then 
xalé) = = | dia ee tra ers = = | dr | dk exp{-2i(kx - ry) — s(r?+ k?)} 
= = | dr | dk exp{—s(r— iy/s)? — s(k + ix/s)*} exp{—(x? + y?)/s} = en lePis | (4.60) 


To calculate yn(€), given by eq. (4.19), we now use the general relation 

xn(E) = Tr ee) = Tr et ef") elf er. (4.61) 
Therefore, we see that 

xn(é) = exp{-|éP(1— 5)/s}. (4.62) 


If we set A = (1- s)/s = (e° - 17? then from eq. (4.21) we have 


P(a)= | agente" ene = | dx | dy expl2i(kx - ry) -A(x + y} 


ee os ach Bho _ _t,]2 hw _ 
m „(€ 1) exp[-|a|? (e*** - 1)]. (4.63) 


For this system P(q@) is a well-behaved function, a Gaussian in fact, and has no singularities. It is 
even positive definite. O(a) is also well behaved, but this comes as no surprise. Our general discussion 
had ensured that this would be the case. 


4.5. Distribution functions on four-dimensional phase space 


We would now like to briefly discuss some distribution functions which are functions on a four- 
rather than a two-dimensional phase space. The first of these, the R representation, was discussed by 
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Glauber in his 1963 paper. It is very well behaved but has found little use in applications. More recently 
a new class of these distributions, the generalized P representations, has been used to study the photon 
statistics of various non-linear optical devices [Walls, Drummond and McNeil [1981]; Drummond and 
Gardiner [1980]; Drummond, Gardiner and Walls [1981]). 


The R representation of the density matrix is obtained by using the coherent state resolution of the 
identity twice. One has 


= | Pa | dg exp{-Hal?+ IB} Ra’, B) la) (Bl, (4.64) 


where lx) is defined in eq. (4.5) and |B) has the corresponding meaning, and 


R(a*, B)= exp{(lal? + 8P) (alp|B) . (4.65) 
This representation has no singularity problems. Also it exists and is unique for all density matrices 


provided that R(a*, B) is an analytic function of a* and B (Glauber [1963b]). It can be used to evaluate 
normally ordered products. One has 


(EY a") = Trló (4° a" |= = | da | dB expt-(a + |BP)+ B*a} R(a*, B)a™ (B°). (4.66) 


The generalized P representations (Drummond and Gardiner [1980]; Drummond, Gardiner and 
Walls [1981]) are again functions of two complex variables but are not necessarily defined for all values 
of these variables. To define these representations we define the operator 


A (a, B) = |a) (B*|KB*|a) (4.67) 


and an integration measure du (a, ß). It is the choice of this measure which determines the distribution 
function. We will consider two different choices. The density matrix is then 


b= | dula,B) Pla, B) A(a, B), (4.68) 
D 
where D is the domain of integration. Normally ordered products are then given by 


(ary (6)")= | Aula, B) Pla, B) B" a”. (4.69) 


Our first integration measure is dy (a, ß)= da dB where a and ß are to be integrated on some 
contours C and C’ respectively. This gives rise to what is called the complex P representation. Let us 
consider the case in which C and C’ are contours which enclose the origin. One can then show 
(Drummond and Gardiner [1980]) that if the density matrix is of the form 


p= >. Dd om |n) (ml, (4.70) 
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where both sums are finite then P(a, 8) exists and is analytic when neither @ nor B is 0. Whether 
P(a, B) exists for a general density matrix is not known. The complex P representation is also not 
unique; if one complex P representation exists for a given density matrix, then an infinite number of 
representation exist. 

The second measure which we wish to consider is du (a, 8) = da? dß?. Because the coherent states 
are linearly dependent such a representation is not unique. In fact we have encountered one 
representation of this type already, the R representation. It is possible to choose P(a, £) so that it is 
real and non-negative (Drummond and Gardiner [1980)), i.e. 


P(a, 8) = (1/4?) exp{—ala - B*P} Gla + P*a + B*)). (4.71) 


This representation, the positive P representation, is defined for all density matrices. 

These two distributions have been used in problems in which non-classical photon states (states 
which are more like number states than coherent states) are produced. Under these conditions the 
above defined generalized P representations are better behaved than the original P representation. For 
example, the P representation corresponding to a density matrix 6 = |n} (n| contains derivatives of delta 
functions up to order 2n. On the other hand, the complex P representation for this state (again defined 
on two contours C and C’ encircling the origin) is just (Drummond and Gardiner [1980]) 


P(a, B) = -(1/4?) n! e® (1/aB)"** (4.72) 
while the positive P representation is, from eq. (4.70) 
P(a, B) = (1/477) (1/n!) exp{-da - B*?} expt-a + B*?} (a + B*)". (4.73) 


Both of these functions are far less singular than the original P representation. 

The original motivation for the introduction of these generalized P distributions was connected with 
their practical applicability to the solution of quantum mechanical master equations (Drummond and 
Gardiner [1980]; Drummond, Gardiner and Walls [1981]). In general, using a coherent state basis, it is 
possible to develop phase-space Fokker-Planck equations that correspond to quantum master equations 
for the density operator (Haken [1970]; Louisell [1973]). From this equation observables are obtained in 
terms of moments of the P function. However, for various problems, as for example the analysis of 
recent experiments on atomic fluorescence (Kimble, Dagenais and Mandel [1978]) where we are dealing 
with non-classical photon statistics (Carmichael and Walls [1976]), the Glauber-Sudarshan P function is 
singular whereas the generalized P function discussed above is not. Also, use of the latter leads to 
Fokker-Planck equations with positive semi-definite diffusion coefficients whereas the former gives rise 
to non-positive-definite diffusion coefficients. In particular, the generalized P representations were 
applied successfully to non-linear problems in quantum optics (two-photon absorption; dispersive 
bistability; degenerate parametric amplifier) and chemical reaction theory (Drummond and Gardiner 
[1980]; Drummond, Gardiner and Walls [1981]; Walls and Milburn [1982]). On the other hand, the 
usefulness of the Wigner distribution in quantum optics has been demonstrated in a paper by Lugiato, 
Casagrande and Pizzuto [1982] who consider a system of N two-level atoms interacting with a resonant 
mode radiation field and coupled to suitable reservoirs. The presence of an external CW coherent field 
injected into the cavity is also included, which allows for the possibility of treating optical bistability 
(which occurs when a non-linear optical medium, interacting with a coherent driving field, has more 
than one stable steady state) as well as a laser with injected signal. 
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5. Conclusion 


We have given what we hope is a useful summary of some of the formalism surrounding the use of 
quantum mechanical quasiprobability distribution functions. To be of use, however, the formalism 
should either provide insight or convenient methods of calculation. In our next paper dealing with 
applications we hope to show that this particular formalism does both in that it has proven to be a tool 
of great effectiveness in many areas of physics. 
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We consider the manifestations of Bose and Fermi statistics on the original quantum distribution 
function. Initially, we consider some general symmetry properties of the density matrix. Next, we 
convert these results into distribution-function language for the case of a system of spin-O particles. 
Finally, we consider a system of spin-+ particles, for which we treat the combined effects of both 


spin and statistics. 


I. INTRODUCTION 


Our purpose here is to consider some manifestations of 
Bose and Fermi statistics on the original quantum distri- 
bution function.! In our earlier work we also used this 
distribution function to calculate quantum corrections to 
the various properties of a Boltzmann gas. The first con- 
sideration to the effects of Bose or Fermi statistics was 
given by Uhlenbeck and Gropper,? who calculated the 
equation of state of both a Bose and a Fermi nonideal gas. 
Explicit spin effects were ignored in these considerations, 
as they were also in the later work of Kirkwood.’ 

After a lapse of nearly 20 years, the subject was con- 
sidered anew by Green,’ who wrote a relation between the 
density matrix determined on the basis of classical statis- 
tics and the corresponding density matrices for particles 
satisfying Bose and Fermi statistics. This relation made 
use of symmetrization operators, which in turn were ex- 
pressed as the product of a number of cyclic operators. 
However, we wish to establish more explicit expressions 
than are given by Green’s equation. 

In their consideration of the quantum theory of trans- 
port in gases, Ross and Kirkwood? wrote down the sym- 
metrized pair distribution function in terms of singlet dis- 
tribution functions by making use of symmetry operators. 
The latter were discussed in detail in Appendix B of their 
paper and they concluded that the symmetry operation on 
the original quantum distribution function is represented 
by an integral operator. A different approach to the sub- 
ject was introduced by Schram and Nijboer,® who intro- 
duced the symmetry requirements by means of a restricted 
summation of states in Hilbert space. Finally, these au- 
thors were able to express the symmetrized distribution 
functions in terms of an integral involving the original un- 
symmetrized distribution function, permutation operators, 
and various 6 functions. They then went on to express the 
partition function as a summation involving permutation 
operators—but did not obtain a very explicit result. 

A particularly important contribution to the subject 
was the article of Stratonovich.’ He recorded the distri- 
bution function for particles of arbitrary spin and intro- 
duced the concept of second quantization in the phase 
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space by essentially replacing the wave functions appear- 
ing in the distribution function by operators. The 
second-quantization approach has been considered further 
by other authors and used as a method for the incorpora- 
tion of Fermi and Bose statistics.°~ 1° 

In a recent series of papers dealing with a one- 
component plasma (also called “jellium”—a system of 
identical particles embedded in a uniform neutralizing 
background of opposite charge), Jancovici!® and Alastuey 
and Jancovici!"!? calculated exchange quantum correc- 
tions in the near-classical limit. They found, both for 
three-dimensional!®!! and so-called two-dimensional sys- 
tems,'' that the exchange free energy is negligibly 
small—a conclusion not changed by the presence of a 
strong magnetic field. !? 

In this paper we wish to consider anew the general sui- 
ject of the manifestations of Bose and Fermi statistics on 
the quantum distribution function. In Sec. II we consider 
some general properties of the density matrix describing a 
mixture of several states. Then, in Sec. III we apply these 
results to a calculation of the effect of statistics on a sys- 
tem of spin-O particles. Finally, in Sec. IV we extend the 
description to a system of spin-+ particles, for which we 
treat the combined effects of both spin and statistics. 
Apart from the obvious potential application of our re- 
sults to a calculation of the equation of state of Bose and 
Fermi gases, we also consider that they should prove use- 
ful in the analysis of the spin-spin correlation experi- 
ments’? which are of interest in connection with tests of 
the Bell inequalities. '* 


II. DENSITY MATRIX FOR A MIXTURE 
OF STATES OF PARTICLES SUBJECT TO BOSE 
OR FERMI STATISTICS 


Consider a system of n identical particles (1,2,..., 
i,...,m) and let x; and x; denote all the coordinates of 
the ith particle, including its spin. Our starting point is 
the density matrix p for a system of identical particles. 
This represents, by definition, a system in which a variety 
of wave functions %, is present with probabilities w,. We 
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have, therefore, by the definition of p, 


PAD wy Whey X25 RR) 
E 

SOX Xap beg ER s3 Xise). (1) 
No matter to which permutation we subject both the x’ 
and the x, the value of p will remain unchanged. This is 
evident if the particles are of Bose type—in this case both 
% factors remain unchanged. It is also true if the particles 
are of Fermi type since in this case either both % factors 
remain unchanged (if the permutation is “even”) or both 
% factors are multiplied by —1 (if the permutation is 
“odd”), so that p remains the same in this case also. It 
will be assumed, therefore, that all p, representing only 
identical particles, are unchanged if both sets of 
variables—the x’ and the x of Eq. (1)—are subject to the 
same permutations. Naturally, if the p refers to a system 
of several types of particles, this remark is valid only for 
permutations which interchange only variables referring 
to identical particles. 

If only the so-called “column variables”—those after 
the semicolon (;) in the argument of p—are interchanged, 


ee a Ee E: 


' , D 7 
PUR aK td aR RR pa) 


, ’ [2 
= P(X], X2... Xis.. 
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p will remain unchanged if the particles are of Bose type 
or if the permutation is even. In these cases all the Y, of 
Eq. (1) remain unchanged. If the particles are of Fermi 
type and if the permutation is odd, the sign of p will be 
changed. The same applies, of course, for the so-called 
“row variables” of p—those before the semicolon (;) in the 
argument of p. 

We will demonstrate now that if p is symmetric in the 
sense previously specified, i.e., invariant under the simul- 
taneous and identical interchange of both row and column 
variables, and if it has the right symmetry property with 
respect to the interchange of one single pair of two 
column variables (or any pair of two row variables), it 
then has the right symmetry property with respect to any 
interchange of variables. 

What will be proved actually is that if p is invariant 
with respect to any simultaneous and identical permuta- 
tion of both row and column variables (which was demon- 
strated for Bose- and Fermi-particle density matrices) and 
if, in addition, it is symmetric or antisymmetric with 
respect to the interchange of the first two column vari- 
ables, 


Kerr GH ip ee ey EE Brecon) ys (2a) 


then it has the same symmetry or antisymmetry with respect to the interchange of any pair of column variables, in par- 
ticular the interchange of the ith and jth column variables, so that 


D ' d ; 
PAN HDs oe A aa Ajr Ape Aaj a) 


' ’ G 
= tplxX1,%X2,.-..,%s--- 
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It follows then from the possibility of arranging any permutation of the x’s by a succession of the interchange of two of 
them that any even permutation of the column indices will leave the density matrix unchanged and any odd permutation 
will do so also for the case of Bose particles and change the sign for fermions. It further follows from the self-adjoint 
nature of p, that is, p(x’;x)=[p(x;x’)]*, that the same is true for the row variables. Hence the symmetry of p with 
respect to any identical interchange of row and column variables, plus the validity of Eq. (2a), establishes the fact that p 
represents a set of identical bosons or fermions, depending on whether Eq. (2a) is valid with the + or — sign. This 
theorem facilitates the establishment of Bose or Fermi statistics in density matrices and hence also in distribution func- 
tions derived from them. 

In order to derive Eq. (2b) from Eq. (2a) we first notice that the interchange of both the x and the x’ sets of variables 
at positions 1 and i and at positions 2 and j leaves p unchanged—as a result of the symmetry requirement. Hence 


PAR i ois ie BER ap Sergey ess) 

pl RR peers kn a GC) 
We now interchange the first two column variables, which gives, according to the assumptions made, 
PAKISTAN Kee) 

FEN RR [yao aa X gece RK) (4 


If we now interchange again, in both row and column, the first and ith variables and also the second and jth variables, 
the desired equation (2b) is obtained. This shows that if Eq. (2a) is valid—or a similar equation for another pair—and if 
p is invariant under the identical permutation of both row and column indices, then the density matrix takes care of the 
Bose or Fermi requirements of the state represented by it—Bose or Fermi depending on whether Eq. (2a) witha + or — 
is valid. This will facilitate the derivation of the proper conditions for the distribution functions. Finally, we repeat that 
if there are several systems of identical particles present, then Eq. (2b) is valid for each set of coordinates referring to the 
same type of particles. 
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III. SPIN-O PARTICLES 


It should be observed that in the preceding argument the x’ and the x can stand for all coordinates of a particle, includ- 
ing its spin. However, in the translation of Eq. (2a) into the language of the distribution function, which follows, we dis- 
tinguish between position and spin coordinates. The effect of the spin can be treated separately. 

We now consider the corresponding distribution functions P(q1,p1,.--54Qj:Pis - - + s 9n»Pn), Which are functions of posi- 
tion and momentum coordinates qj, -s qis -< >, qn and Pis -:.,Pi -: . ‚Pn, respectively. In the classical limit, P(g,p) is 
the phase-space distribution function which gives the probability that the coordinates and momenta have the values g 
and p. Specifically,! 

P(q1P1---»4i>Pis +++ »An»Pn) 
(ae f bees f dy, +++ dyi dy, 
Xp(qi—Y1, 3,4 Vi -> ->n Ins FI -+ -gito >In tYn) 
Xexp[2i(piyit tt +Pyit t +Pa¥n)/A) - (5) 


In Eq. (5), 9;, pj, and y; are considered to be three-dimensional vectors, py is the scalar product of p and y, and dy, 
means integration over all three vector components. All integrations in this paper are from — œ to oo. 
From Eq. (5) it follows that | 


J fdr > days +> dpa Plopi -> GisBis «+ + gm Puder p[—2ilpıyı+ HPI +P adn) A] 
= (G1 Vis + + Gi Vin oo In ~Y FV as -eo tYin -eean tn) (6) 
Hence, changing variables to u; (=9;—y,;) and v; (=q,+y,)i=1,2,...,n, we obtain 
PU ay oo Uys oo ny Uppi oy Vj eee Vp) 
=f oes f dp,:::dp;** dp, P(-4(u,+01),P1, oeeo Flui HY) Pi <- un +a) Dn) 
xexp{ +i[pilu;—v1)+ +++ +pi(u;—vi)+ +++ + py (ug —tn)]/A} . (7) 


At this stage we note that since Eq. (3) refers only to two of the particles, we number these 1 and 2 and, in fact, omit 
the coordinates of the other particles from the equations in order to make them more simple. We will therefore express 
the equation [see Eqs. (2a) and (2b); we omit the + sign since we are dealing here with spin-O particles] 


PCG 11925391192) = (419259291) » (8) 
in which the q stands for all three space coordinates, in terms of the corresponding distribution functions. We then have 
[see Eq. (5)] | 

P(q1,P1,92,P2)=(rA)”* f f plgı-Y1,92—-Y2591 491,92 +y92)expl2i(pıyı +PaY2)/Alayı dyz (9) 
and also [see Eq. (7)] 

px .x2sxixad= | f dpi dp, Pl4(x,+x1),p1,$1x2+x2),p2explip (x4 —x1)/Fi+ ipa(x5 —x2)/A) . (10) 


The condition (8) can therefore be written by equating the right side of (10) with the same expression in which, however, 
x, and x, are interchanged. 
In order to simplify the resulting equation, we introduce new variables instead of the x and the p: 


q(x +x1 +x2+x3)=9, F(x —%1 +x2—x3)=Y, 
| (11) 
q(x 4x4 -92-x3)=q, (xi +x] $42 —%9) = G2 
and 
zlpıtp.)=p, +(pi—P2)=p' - (12) 
Since the interchange of x, and x, is represented by the interchange of q; and q2, this leads to the equation, instead of 
(2aXi =1, j =2), 


4ilp'g,—p/A 4i(p'a, —pN/A 


J Í dp dp' Pia +91,P +p',g -—qu,p -p'e =f f dpdp' Pig +42,P +p',g —42,p —p')e 
(13) 


Since Y appears only in the exponent, the integration with respect to p can be eliminated, leading to (after replacing the 
dummy variable p’ by p;) 


Bose and Fermi Statistics on the Quantum Distribution Function 321 


2616 R. F. OCONNELL AND E. P. WIGNER 30 


4i /A Ai /h 
J dp, Pia +91,P +P1.9 -gip -pıle = f dpi Pig +92,p +719 —92p —Pile PS, (14a) 


It must be admitted that this equation, for the distribution function, postulating the Bose statistics for a system of spin-O 
particles, is much more complicated than the corresponding equation (2a) for the density matrix. 
The essential equivalence of the position and momentum variables in this equation can be demonstrated by multiplying 
it with exp[ —4i (p'qı +p"q2)/ñ] and integrating over q; and q2. The resulting equation is 
” ” —4ip'g, Mi n ; l ’ —4ip"g,/f 
J daı Pig +a1p +p" -gop pe = f dao Pla +q2,p +P'q aop pie S, (14b) 


in the right-hand side of which, naturally, q3 can be replaced by g;. The equation then becomes the analog of Eq. (14a) 
with the roles of p and q interchanged, but the signs in the exponentials reversed. Equally easy an equation can be ob- 
tained in which g and p play essentially the same roles, but the two sides of the equation are quite different. 

This is achieved by multiplying Eq. (14a) with e W/M and integrating over qı. On the right side this gives a factor 
(wR/2)’8(pı —p2) (we must not forget that the q1p, in the exponent is a three-dimensional scalar product), hence one ob- 
tains 


2 . 4i(P 199 —9,P2)/% 
P(g +42,P +P29 —92»P —P2)= 2] J [ Pia +aup +p gip pide" a dp, day , (15) 


which is essentially symmetric in position and momentum coordinates. Naturally, by another Fourier transformation, 
this can be transformed into the analog of Eq. (14a), with the position- and momentum-coordinate roles interchanged. 
As a check, we use Eq. (15) to replace the function P on the right side of Eq. (15), obtaining 


P(g +@2,P +P29 —92;P —P2). , 
=(2/rň f f f f Pia+a2.p+p29 -aip pi) 
x exp[4i (g2p1 —4ıP2)/+4i(qıP2 —42P1)/Ä]dq, dp, dgı dp; . (16) 


Carrying out the integrations over q; and p, we obtain factors (mA/2)’d&(p, —p}) and (mA/2)’ö(g, —g}), as a result of 
which we verify that the right side of Eq. (16) reduces to the left side, proving the self-consistency of Eq. (15). 


IV. SYSTEM OF SPIN-+ PARTICLES 


In the case of a pure state ¥(q) we extend our previous definition of the single-particle quantum distribution function 
as follows: 


Pigp =h E ohm f [Wg +y,m) Wa —y,m')e2 Ady (17a) 
m,m'=1,—1 
in the case of a pure state and 
Piap)=(M E hm f plga—yım';q +y,m)e dy (17b) 
m,m'=],—1 


in the case of a mixture of states, where « takes on the values 0, x, y, and z; the o’ is the unit matrix and the others are 
the Pauli matrices for the spin, the rows and columns being labeled 1 and — 1. Specifically, 


o |10 0 -i 
ee ee, 
(18) 
01 1 0 
= 11 op 0-1 
If Eq. (17a) is integrated over momentum p, one obtains 
J Plq.p,0)dp = | W(g,1)|?-+ | Wg, —1) |?=plg. liq 1)+p(g,—139,—-1) , (19a) 
J Plap.xddp = {ig DI Wg, — +1 0g, — DI Wa, 1} , (19b) 
J Plap.yidp =i (iMa, 1g, — 1) ita, — DP Wg D} , (19c) 
[ Plqp,z)dp = 1%, |?— | Wg —1)1?. (19d) 


Thus Eq. (19a) gives the total probability of finding the particle at position q, irrespective of its spin state, whereas Eqs. 
(19b)—(19d) give the difference in the probabilities of finding the particle with spin up and spin down, referred to the x, 
y, and z directions, respectively—always at g. Hence, the normalization of P, though it involves integration over q and 
P, restricts the value of x, actually to zero. 
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In the case of a two-particle system, Eq. ee to 


P(g,PuKg92Pak)=A) SL > On ule ne mi, J f (qi +yom;g +y) 


mm} mm; 


X Wg —Y1,7 15492 —Y2,m2 )exp[2i (pıyı +Pıy2)/Aldy, dy. 


(20) 
in the case of a pure state and to 
P(q1,P1>K1592»P2K2) 
=A E E onm Imm, J | dri dys expl2i(piys +P 22/4] 
mmi M3, m; 
xXplgı -yu,m 1,92 -Yam25gı +91, M1,92+Y2»M2) » (21) 


in the case of a mixture of states. They follow from the fact that the 4x4 matrix M emm =2" om is unitary. One 
has, therefore, 


> om Ohm)" = > Om’ Oh'm = 26, . (22) 


m,m’ m,m' 


The second expression is equal to the first because all o* are self-adjoint. Similarly, we have 
> Omm Onn) = > Omm’On'n =25 mn Sm'n’ . (23) 
K K i 


It then follows from Eq. (17b) that 
plx'n';x,n)=3 Donn f dp P(4(x +x’),p,n)eP x") 4a) 
K 


or, with Eq. (10), for the two-particle situation, 


p(x},mı,x3,m3;x,mı,x,m.)=7 > o p J S a1 dp PG (xy +24 ,PuK2(%2+%2),P2,K2) 


mim, 
Ky Ke 


Xexp[ip (x —x1)/Ä+ip2:(x3 —x2)/ň] , (24b) 


from whence it follows that the generalization of Eq. (14a), to the case of spin-+ particles, is 


>> Xe ym, f Pia tanp +PıKg 9, P —Prsky)e P192/9 dp, 


Ki K 


Mh 
=- >> an Trim, f P(g +42,P +P1.K139 —92,P —Pı,Ka)e “Pi dp,, (25) 
KI Ky . 
with the minus sign arising from the fact that we are dealing with fermions. This equation will be simplified by express- 
ing the product of the two o matrices on the left side, 


[o(kı,K.)] =o), 0: 26 
2) Im mg emmy =F mim mim; ’ ni 


by the product of the o matrices on the right side, 


Ki k, 

[rtkur2)],. m} ;mım, =O mim, I mim, : (26b) 
We can then write 

T(K},K2)= > B (Kik didol A) ’ (27a) 


AyAy 


and since the a(,,K2) as functions of the m are linearly independent of each other (in fact orthogonal), we can write, in- 
stead of Eq. (25), 
4ip 9, /% 
f P(q +41>P +P1,K139 —Q1sP —p1,K2)e 192 dp, 
== 2 BOA) J Pig +92,P +P1Aisg —92.P —Pird2 oP dp,. (27b) 
Ayda 
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Equation (27b) is the basic result sought, being a conse- rt x,x)=+[o(x,x)—o(y,y)—o(z,z)] + +0(0,0) 
quence of the antisymmetry of the wave function for 
spin-+ particles. The matrix B is much simplified by the =o(x,x)— zo(f,7)+70(0,0). (28h) 


fact that the rotational transformation properties of the o 
and 7 matrices are the same—they represent either scalars 
or vectors or tensors. Hence, for example, the matrix ele- 
ment of B which connects the scalar component of 
0(A,,A2) with the vector components of r(x,,«,) vanishes. 
It may be stated additionally, that B is unitary and its 
square is the unit matrix so that its characteristic values 
are all 1 or —1. It is not difficult to calculate B—its ele- 
ments are, of course, the same for the different com- 
ponent parts of the vector (tensor) components of o(A,,A2) 
which are connected to the vector (tensor) components of 
T(K DK2 ). 

As can be easily verified, the explicit form of the rela- 
tion (27a) is 


7(0,0)= +a(0,0) + +[0(x,x)+o(y,y) +o(z,z)] 


= +[0(0,0)+0(7,7)]; (28a) 
AT, P=r(x,x)+7y,y)+7(z,z) 
= 20(0,0) _ +[01x,x) +o(y,y) +o(z,z)] ; (28b) 
xy) +7(y,x)=a(x,y)+oly,x);T(y,z) +7(z,y) 
=0(y,z)+a(z,y),...3 (28c) 


r(0,x)+7(x,0)=0(0,x)+0(x,0),...: , r(0,2)+r(2,0) 

=0(0,z)+0(z,0) ; (28d) 
(0, x)—-r(x,0)=—i[o(y,z)-o(z,y)],...; (28e) 
1 y,z)—7(z,y) =i [o(0,x)—o(x,0)],...5 (28f) 
2r(x,x)—r(y,y)— r(2,2) 


=20(x,x)—oaly,y)—o(z,z),...3 (28g) 
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Clearly, these equations are not all independent of each 
other—Egs. (28b) and (28c) follow from Eq. (28h) and 
show only the effects of the invariances more explicitly. 

Evidently, the consequences of the wave functions an- 
tisymmetry for spin-+ particles are much more compli- 
cated than for the density matrix even though the preced- 
ing equations could be given a more simple form. 
Nevertheless, it should not be truly difficult to give Eq. 
(27b)—the basic equation—a more explicit form and also 
a more simple one by introducing, instead of the indices x 
and A the indices referring to the left sides of the equa- 
tions (28). Another possibility is to accept the matrix B as 
a basic quantity. One way this could be done would be to 
decompose B into three parts: those referring to scalar, 
vector, and irreducible-tensor expressions in terms of the r 
appearing in Eq. (28). The first part of B would be two 
dimensional, the second part would contain three separate 
three-dimensional matrices [referring to 7(0,x), 7(x,0), 
t(y,z)—7(z,y), and the other two similar triplets], and the 
last one would be a five-dimensional unit matrix referring 
to the five components of the five-dimensional representa- 
tion of the rotation group [cf. Eqs. (28c) and (28g)]. But 
even if this is done, it must be admitted that the equations 
will be more complicated than the equations postulating 
Fermi statistics for the density matrix. 
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It is shown that, in the phase-space representation of quantum mechanics, the uncertainty rela- 
tion can be stated in terms of the integral invariant of Poincaré. The uncertainty relation for 
spreading free wave packets is discussed as an illustrative example. This phase-space approach can 
be extended to the relativistic regime. It is shown that Lorentz boosts are area-preserving canonical 
transformations in the phase space of the light-cone variables. The harmonic oscillator is discussed 
in detail as an illustrative example for the covariant realization of the uncertainty principle. 


I. INTRODUCTION 


The phase-space representation of quantum mechan- 
ics! is of current interest. We have shown in our previous 
paper’ that the light-cone coordinate system is the natu- 
ral language for the Lorentz-covariant phase-space repre- 
sentation of quantum mechanics. The localized light 
wave was discussed as an illustrative example. However, 
the light wave depends on only one of the two light-cone 
variables. 

The purpose of this paper is to discuss a physical exam- 
ple depending on both of the light-cone variables. The 
covariant harmonic oscillator will serve this purpose. 
The harmonic oscillator occupies a very prominent place 
in the physics of phase space. Unless we know how to 
deal with the covariance of the harmonic oscillator, we 
are not likely to understand the covariance of phase 
space. 

In the phase-space representation, the uncertainty rela- 
tion is stated in terms of the integral invariant of Poin- 
care,’ which is called the “error box” in the current 
literature. The area of the error box cannot be smaller 
than Planck’s constant. In order to illustrate the advan- 
tage of using the phase-space representation, we start 
with the problem of wave-packet spreads. In the 
Schrodinger picture, the uncertainty product increases as 
time progresses or regresses. On the other hand, the 
volume of the error box remains constant for the spread- 
ing wave packet, even though its shape is deformed.° 
This means that the wave-packet spread is a canonical 
transformation in phase space. 

The phase-space representation of nonrelativistic quan- 
tum mechanics has a built-in symmetry which is 
mathematically equivalent to that of the (2+1)- 
dimensional Lorentz group. Since the position and 
momentum variables are c numbers in the phase-space 
representation, it is possible to formulate canonical trans- 
formations in a manner identical to the case in classical 
mechanics. Indeed, the group of linear canonical trans- 
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formations is Sp(2) which is isomorphic to the (2+ 1)- 
dimensional Lorentz group.” 

In this paper, we shall show that Lorentz boosts are 
canonical transformations in the phase space consisting 
of the light-cone variables. Thus, from the mathematical 
point of view, the Lorentz covariance does not add new 
complications in phase space. However, from the physi- 
cal point of view, we are dealing with a realization of the 
uncertainty principle which is different from that in the 
Schrodinger picture of quantum mechanics. The phase- 
space representation enables us to state the uncertainty 
relation in a Lorentz-covariant manner. 

In Sec. II, we study the phase-space representation for 
wave-packet spreads and compare it with the case of har- 
monic oscillators. It is pointed out that the uncertainty 
relation can be stated in terms of the area of the error box 
in phase space. The area of the error box remains un- 
changed as time progresses or regresses. It is shown that 
the error boxes for the spreading wave packet and the 
harmonic oscillator coincide with each other in the 
large-time and weak-spring-constant limits, respectively. 
In Sec. III, the wave-packet spread is formulated in terms 
of homogeneous linear canonical transformations in 
phase space. 

Sections IV and V consist mostly of reviews which are 
needed for the covariant formulation given in Sec. VI. 
Section IV deals with the phase-space representation for 
nonrelativistic harmonic oscillators. Section V is based 
on the covariant harmonic-oscillator formalism which 
constitutes a representation of the Poincaré group, and 
which effectively describes the basic phenomena of rela- 
tivistic hadrons in the quark model. 

In Sec. VI, we combine Secs. IV and V to formulate the 
covariant phase-space representation of harmonic oscilla- 
tors. The conclusion of the present paper is that the con- 
cept of the covariant error box in phase space gives the 
physical basis for the phenomenology based on the 
mathematics of the covariant harmonic-oscillator formal- 
ism. 
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II. WAVE-PACKET SPREADS 


Let us start with a one-dimensional harmonic oscillator 
in its ground state. The uncertainty product remains in- 
variant when the spring constant is gradually reduced. 
On the other hand, when the oscillator force is suddenly 
removed, the uncertainty product becomes dependent on 
time. Although these two different cases are two different 
manifestations of one physical principle, the wave-packet 
spread has been one of the agonizing features of the 
present form of nonrelativistic quantum mechanics. 

In the phase-space representation of quantum mechan- 
ics, the uncertainty relation can be stated in a time- 
independent manner for both the harmonic oscillator and 
the spreading wave packet. If (x,t) is the physical solu- 
tion of the Schrödinger equation, the phase-space distri- 
bution function is defined as’ 


BEE eee ee ae a er eee ee | 


2 ZB: > 
z l ~P D= 2 Iran È 


where m is the mass of the particle, and V(x) is the po- 
tential. In the case of the harmonic oscillator with 
V(x)=Kx?/2, the above differential equation becomes 


2 Pixp n= = a P pe Kx Pix.) : 
(2.4) 
If the particle is free, this differential equation becomes 
È Pixp n=- 2 Z piap . (2.5) 
The solution of the above differential equation is? 
P(x,p,t)=P(x —pt/m,p,0). (2.6) 


The time evolution of this solution is illustrated in Fig. 1. 
Indeed, the error box undergoes a shear. The volume of 
the error box is invariant under time evolution. This is a 
more precise statement of the uncertainty relation than is 
given in the Schrödinger picture. 

If we start with a free-particle wave packet with a 
Gaussian momentum wave function 
1/4 


b _pk? 
er 


g(k)= (2.7) 


at t=O, the time-dependent Schrodinger wave function 
becomes 


1/2 
e —x?/Ub+it/m) . 


(x)= (2.8) 


1/4 
b S SN 
T b +it/m 
If we construct the PSD function for the above wave 
function, its form is 


2n 
1 1 
2 (2n +1)! 
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Plxpt= f U(x +y, plx —y,tePdx , (2.1) 


which we shall hereafter call the PSD function. This is a 
function of t, x, and p, which are c numbers. This func- 
tion is real but is not necessarily positive everywhere in 
the phase space of x and p. We can, however, recover the 
positive distribution functions in the position and 
momentum coordinates 


p(x,t)= | Wx,0)|2— f P(x,p,t)dp , 


(2.2) 
o(p,t)= | ¢(p,t)|?= f P(x,p,tidx , 


where d(p) is the momentum wave function. 
The time-dependent Schrodinger equation leads to the 
differential equation! 


2n +1 


ð V(x) 


ox (2.3) 


2n +1 
ax | P(x,p,t) , 


maaan 
Pix,p,t)= exp] —[(x —pt/m}?/b +bp?°)} . (2.9) 


This distribution is concentrated within the region where 
the exponent is less than 1 in magnitude. This region is 
described by the tilted ellipse described in Fig. 2. This is 
the error box for the spreading wave packet. 

Since x and p are c numbers in the phase-space repre- 
sentation, the PSD function P(x,p,t) can be canonically 
transformed in phase space, as is done in classical 
mechanics. The concept of the error box is already in the 
Poisson-bracket formalism of classical mechanics. Its 
volume is invariant under canonical transformations. 
This is called the integral invariant of Poincaré. Howev- 
er, classical mechanics does not give the lower limit on 
the size of the error box. 

Let us go back to the wave-packet spread. How is the 
spreading Gaussian wave packet different from the 
ground-state harmonic oscillator whose mass and spring 
constant become adiabatically weak? This is a situation 


FIG. 1. Shear in phase space. Every point in phase space 
moves horizontally in the x direction with velocity proportional 
top. This is an area-preserving transformation. 
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very familiar to us, and is described in Fig. 2, in which 2 0 0 in 

the x axis expands and the p axis contracts. This defor- AN: EN a 3.3) 
mation is also an area-preserving canonical transforma- i 0 —i/2p "2 |i/⁄2 0f’ ` 


tion, and is commonly called “squeeze” in the literature.’ 

The spread of the Gaussian wave packet is also illus- 
trated in Fig. 2. This is consistent with the shear effect 
given in Fig. 1. It is possible to take the “projection” of 
this elliptic distribution to the x and p spaces using the 
formulas for p(x) and o(p) given in Eq. (2.2). These 
probability densities lead to the uncertainty product in 
the Schrödinger picture, which expands as time 
progresses or regresses. 

The error box for the spreading wave packet in the 
infinite-time limit coincides with the error box in the lim- 
it of zero spring constant. In the phase-space representa- 
tion, the magnitude of uncertainty is the same for both 
cases. In the Schrödinger representation, one is finite 
while the other is infinite. 


II. WAVE-PACKET SPREADS IN TERMS 
OF CANONICAL TRANSFORMATIONS 


In order to study canonical transformation properties 
of the shear described in Fig. 1, let us write the solution 
of Eq. (2.5) given in Eq. (2.6) as 


1 t/m 
0 1 


’ 


x x 
r 


P P 
On the other hand, the group of homogeneous linear 
canonical transformations is Sp(2),*6 which is locally iso- 
morphic to the (2+ 1)-dimensional Lorentz group.’ 
Indeed, the group of homogeneous linear canonical trans- 


formations consists of rotations around the origin gen- 
erated by 


(3.1) 


0 —i/2 


I=|ın 0 


, (3.2) 


and squeezes® along the x axis and along the x =p line 
generated by 


Sheor 


x 
Nee Squeeze 


FIG. 2. Ground-state harmonic oscillator and the spread of 
the Gaussian wave packet in phase space. In terms of the in- 
tegral invariant of Poincaré, the uncertainty relation can be 
stated in the same manner for both cases. Their projections to 
the x and p axes are different. In the oscillator case, the p distri- 
bution contracts while the x distribution expands. On the other 
hand, in the case of spreading wave packets, the p distribution 
does not change. This is why the uncertainty product for 
spreading wave packets increases as time progresses or 
regresses. 


respectively. These generators satisfy the commutation 
relations 


[B,,B,)=-—iL, [B,,L]=—iB,, [B,,L]=iB, . (3.4) 


This set of commutation relations is identical to the Lie 
algebra for the (2 + 1)-dimensional Lorentz group.” On 
the other hand, the shear transformation of Eq. (2.9) is 
generated by the matrix 


0 i 
= [0 0l? (3.5) 
with 
e-ittsmN_ 1 i 


Then, where does the matrix N stand among the genera- 
tors of canonical transformations given in Eqs. (3.2) and 
(3.3)? 

In order to answer this question, let us construct first 
the squeeze operator 


exp(n/2) 0 
0 exp —n/2) 


—inBy 


S(q)=e = . (3.6) 


This operator expands the x axis while contracting p. 
This is of course a canonical transformation. Thus the 
following operators also generate homogeneous linear 
canonical transformations: 


B,=B,=S(7)B,S(—7), B3(y)=e~"S(y)B,S(—7n), 
(3.7) 


L'(n)=e~"S(n)LS(—7) . (3.8) 


In the limit of large 7, the above operators become B,, 
N, and N, respectively. These operators do not form a 
closed Lie algebra of any group, but give a partial view of 
a more complete group-theoretical picture that both the 
Lorentz-boosted O(3) and O(2,1) become a group locally 
isomorphic to the two-dimensional Euclidean group in 
the large-n limit.”!° 

In the present case, the O(3)-like group is generated by 
L, L,, and L,, where 


+ 0 
0-4 


05 


L,= use (3.9) 
1 2= 110 


The O(2,1)-like group is generated by L,, B,, and B}, 
where 
0 7 


(3.10) 
-10 


B,= 


The mathematics of this process is called the contraction 
of O(3) and O(2,1) to E(2).”?"° The physical content of 
this process is that a unified description can be given for 
the O(3)-like, E(2)-like, and O(2,1)-like internal space- 
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time symmetries of massive, massless, and imaginary- H=4 P?+x?). (4.1) 


mass particles, respectively, as Einsteins E =(m? 
+p?)!/” unifies the energy-momentum relation for all rel- 
ativistic particles. This point has been discussed in the 
literature,'°!! and is illustrated in the first and second 
rows of Fig. 3. 


IV. HARMONIC OSCILLATORS 


The Hamiltonian for the one-dimensional nonrelativis- 
tic harmonic oscillator with unit frequency can be written 
as 


a Se a Se ee 


P,(x,p)= 


n! 
T 


k=0 


where r?=(x?+p?). 

The above form is defined in the two-dimensional 
phase space spanned by x and p axes. Since it depends on 
x and p only through the variable r, the function is invari- 
ant under rotations around the origin. We can thus write 
P,,(x,p) as 


P,(x,p)=P,(r) . (4.5) 


As is expected, this function is positive in some regions 
and is negative in other regions in phase space. It van- 
ishes on the circles on which the polynomial contained in 
P,,(r) of Eq. (4.5) is zero. 

In order to study this more systematically for the har- 
monic oscillator, let us see whether P,,(x,p) of Eq. (3.5) 


Mossiess 
Fost 


Mossive 


Slow — between — 


Spin, Gouge 


: —— Little Groups ~ 
Helicity 


Gouge Trons. 


Covariant 
Phose Space 


Quarks 
Partons 


)— Parton Model 


Quark Model —( 


FIG. 3. Slow and fast particles. Einstein’s E=(P?+m?)!”? 
unifies the energy-momentum relations for massive (nonrela- 
tivistic) particles and for massless particles. The second row in- 
dicates that the little group of the Poincaré group unifies the 
internal space-time symmetries of massive and massless parti- 
cles, as is discussed in Ref. 11. The third row states that the co- 
variant phase-space representation forms the physical basis for 
the covariant harmonic-oscillator formalism which has been 
shown to give a unified picture of quark model and the parton 
picture at the phenomenological level. This point is discussed 
in Sec. VI of the present paper. 


[exp( —r*)] S (- Ka" Ar -0A [in —k NP KY , 


The normalized solutions of the Schrödinger equation are 
Y (x)=[1/(V72"n1)]! 2H, (xexp(—x2/2) , (4.2) 


where H,(x) is the Hermite polynomial of nth order. 
These wave functions are in the energy eigenstates. It is 
possible to evaluate the quantum PSD function 


P (xp) = f v(x ty pleimay. (4.3) 


The result of the calculation is!? 


(4.4) 


ss 
can be interpreted in terms of another equation. The 
PSD function given in Eq. (3.5) indeed satisfies the 
differential equation”? 


a |_d 
dp |dp 


1 


F P (r) | |+4p?P,(r)=(2n + DP, (r) , 


(4.6) 


where p=V2r. This is the radial part of the rotation- 
invariant Schrödinger equation for the harmonic oscilla- 
tor in two-dimensional space spanned by the variables 
V2x and V2p. If we use R,(p) for the normalized radial 
equation for the kth excited state with the eigenvalue 
(k +1) with the orthogonality relation 


JI PR PRm iPd p= Sam l (4.7) 
then the PSD function is 

P,(n=(1/V4mR(p). (4.8) 
Therefore P, satisfies the orthogonality relation 

J PS (x,p)P,(x,p)dx dp =2n f P(r)P„(rIr dr 

=m - (4.9) 
For the ground state, the PSD function is 

Pulx,p)= Zexpl —(x?+p?)} . (4.10) 
Ifn =1, 

P,(x,p)= Z (x?-+p?—Lexpl —(x?+p?)] . (4.11) 


We can then use the Schmidt orthogonalization pro- 
cedure to construct the PSD function for higher values of 
n. 

In this paper, we are interested in homogeneous linear 
canonical transformations in phase space consisting of ro- 
tations and squeezes along a given direction. The above 
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PSD functions are rotationally invariant. The squeeze 
along the x direction means that the coordinate variable 
x is multiplied by a real positive number b, while the p 
variable is divided by b. The integral measure (dx dp) 
remains invariant during this process. 

In this.case, the group of homogeneous linear canoni- 
cal transformations is generated by 


i ð ð 
ne ap ax i 
i ð ð i ð ð 
2 | tr |, u. 
Bima ax Bl ap Pax oe) 


which satisfy the commutation relations for the genera- 
tors of the (2 + 1)-dimensional Lorentz group given in 
Eq. (3.4). While this group is one of the fundamental 
symmetry groups in the physics of phase space,® we shall 
see in Sec. VI that the Lorentz boost is a squeeze in the 
phase space of the light-cone coordinate position and 
momentum variables. 


2 


where x, and x, are space-time coordinates for the first 
and second quarks, respectively. This partial differential 
equation has many different solutions depending on the 
choice of variables and boundary conditions. 

In order to simplify the above differential equation, let 
us introduce new coordinate variables!” 


X =(x,+x,)/2, x =(x,—x,)/2V2. (5.2) 


The four-vector X specifies where the hadron is located in 
space-time, while the variable x measures the space-time 
separation between the quarks. In terms of these vari- 
ables, Eq. (5.1) can be written as 


3 02 


2 
_ X 
ax? ; 


— (X,x)=0. 
əx? P 


(5.3) 


2,1 
Mo + ) 
This equation is separable in the X and x variables. Thus 


QX,x)=f (X) W(x) , (5.4) 


and f (X) and (x) satisfy the following differential equa- 
tions, respectively: 


ə Oo, 

ss — me —-(A41) |fm=0, (5.5) 
ax? 

EI Aes 

2 [a7 ++ |Wa)=0 | (5.6) 


+ txt x? Hm lolx x) =0 , 


V. COVARIANT HARMONIC OSCILLATORS 


The covariant harmonic oscillator has a long history. 
It was Dirac who suggested in 1945 the use of harmonic 
oscillators to construct representations of the Lorentz 
group.'* Yukawa in 1953 studied the possibility of using 
the oscillator for studying relativistic composite parti- 
cles.'” However, its physical relevance was not revealed 
until the successful calculation of the proton form factor 
by Fujimura, Kobayashi, and Namiki in 1970.'® In the 
1971 paper of Feynman, Kislinger, and Ravndal, the au- 
thors point out the need for relativistic bound-state mod- 
els, such as the harmonic oscillator model, in order to 
supplement the traditional Feynman-diagram approach 
which is not always effective in dealing with covariant 
bound-state problems.?” 

The covariant oscillator formalism has been extensive- 
ly discussed in the literature.’ It serves as one of the 
physical representations of the Poincaré group. At the 
same time, the formalism allows us to explain the pecu- 
liarities of Feynman’s parton picture in terms of the 
bound-state quark model.” 

Let us start with the differential equation of Feynman 
et al." for a hadron of two quarks bound together by a 
harmonic-oscillator potential of unit strength 


(5.1) 


ch 


Equation (5.5) is a Klein-Gordon equation, and its solu- 
tion takes the form 


Sf (X)=exp(tiP,X*) , 
with 
—P?=—P,PH=M*=m$+(A+1), 


(5.7) 


where M and P are the mass and four-momentum of the 
hadron, respectively. The eigenvalue A is determined 
from the solution of Eq. (5.6). We are using the same no- 
tation for the operator and the eigenvalue for the hadron- 
ic four-momentum. This should not cause any confusion 
since we are dealing only with free hadronic states with a 
definite four-momentum. 

As for the four-momenta of the quarks p, and p,, we 
can combine them into the total four-momentum and 
momentum-energy separation between the quarks!” 


P=p,+Py, 9=V2p,—DPy) , (5.8) 


where P is the hadronic four-momentum conjugate to X. 
The internal momentum-energy separation q is conjugate 
to x provided that there exist wave functions which can 
be Fourier transformed. If the momentum-energy wave 
functions can be obtained from the Fourier transforma- 
tion of the space-time wave function, the differential 
equation in the q space is the same as the harmonic oscil- 
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lator equation for the x space given in Eq. (5.6). 

Since the three-dimensional harmonic oscillator is 
quite familiar to us, we are naturally led to consider the 
separation of the space and time variables in Eq. (5.6). 
However, the (x) system is not the only coordinate sys- 
tem in which the differential equation is separable. If the 
hadron moves along the z direction with velocity parame- 
ter B, the hadronic rest frame is important. In this frame, 
the coordinate variables are 


x'=x, y'=}, 
z'=(z—Bt) 1-6)!” , 
t'=(t —Bz)/(1—B")'” . 


The differential equation of Eq. (5.6) is separable also in 
these variables: 


a? 
ar’? 


(5.9) 


— 7'241 — —[(x' —2' ?] | ix) =(A4+ x) 


4 
2 
(5.10) 


The solution of this equation consists of a product of 
four one-dimensional oscillator wave functions. The x’ 
and y’ components are not affected by the boost along the 
z direction. Thus we can drop them from our considera- 
tion. As for the t' component, the excitation contributes 
a negative number to A. However, this excitation can be 
suppressed on the grounds that the time and energy vari- 
ables are c numbers. Indeed, the time-energy uncertainty 
relation is a c-number relation.'” This suppression of 
timelike excitations can be achieved by the subsidiary 
condition 


P, |x#—~—— |ygx)=0 (5.11) 


u 


Then the wave function takes the form 
Yaz, t)= [1/(72"n!)]'7H, (z expl — 4z? +r ?)]. 
(5.12) 


This normalizable wave function describes the internal 
space-time structure of the hadron moving along the z 
direction. If ß=0, then the wave function becomes 


yiliz, t)=[1/(72"n!)]! 2H, (z)exp[—4(z?+t°)] . 


(5.13) 
Thus 


Yaz t=’). 


VI. COVARIANT PHASE-SPACE 
REPRESENTATION FOR HARMONIC OSCILLATORS 


(5.14) 


One of the most outstanding problems in modern phys- 
ics is how to formulate covariantly interactions between 
two elementary particles. For instance, we still do not 
know exactly what force is responsible for keeping the 
quarks inside a hadron. It may be possible to make the 
interaction invariant by postulating that it depends on 
the distance in the coordinate system at rest with the 
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temporary center of mass of the particles, or that it de- 
pends on the two positions at the time when their relativ- 
istic distance is zero—when one is on the light cone of 
the other.”° 

In this paper, we take the light-cone approach. While 
it is not possible to solve all the problems at this time, we 
can discuss the uncertainty principle applicable to the 
space-time separation between the quarks in a harmonic 
system, using the light-cone variables. The covariant 
harmonic oscillator discussed in Sec. V serves as a 
theoretical tool for this purpose. The harmonic-oscillator 
wave function consists of a Gaussian factor and Hermite 
polynomials. Since the Gaussian factor determines the 
localization property of the wave function, let us study 
first the ground-state wave function, whose form is 
1/2 

exp[ —(z?+17)/2] . 


Oz, t)= (6.1) 


We have dropped the x and y variables which are not 
affected by the Lorentz boost along the z direction. 

This wave function can be written in the light-cone 
coordinate system.?! If the hadron moves along the z 
direction, the light-cone variables are defined to be 


u =(t +2)/V2, v=(t—2)/V2. (6.2) 
Their Fourier conjugate variables are’? 
Qu=(92—Go)/V 2, Gy =(4, +49)/V2 (6.3) 


The major advantage of using these variables is that the 
Lorentz boost of Eq. (5.9) takes a very simple form:!® 


1/2 
u= 1-B Zn 1+8 
1+8 l 1—8 i 
148 1/2 i B 1/2 (6.4) 
u 1—8 Qu» qy= 1+8 qy . 


Under this transformation, the products ug, and ug, 
remain invariant. 

In terms of the light-cone variables, the wave function 
of Eq. (6.1) can be written as 


Yz, t) = 


-|ż 


If the system is boosted, the wave function becomes 
1/2 


exp[ —(u'?+v'?)/2] 


Xuv) 
172 


exp[—(u?+v?)/2]. (6.5) 


Y(2,t)= 


-|2 


1/2 
exp 


1-B u? 
1+B" 


1 
2 


al 


(6.6) 
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This wave function undergoes a Lorentz deformation as 8 i 1/2 i 1468 
increases.!® The momentum-energy wave function is PH 9y +9, )= È exp | — È ae q? 
T — 
: L | ne N dt (6.7) +28 9 (6.8) 
PEI W 7, | J Yate 8% 


The evaluation of this integral leads to 


m Sat Re Fe an ee 


1 
P&(u,q,;0,9,)= = 


2 
f (wu +x,v +y)]" Yu —x,v —yJexp[2i(g,x +9,y)]dx dy . 


For the ground state, the PSD function can now be 
defined as 


(6.9) 


After the evaluation of this integral, the PSD function becomes 


1-B 3 


1 
T 1+8" 2 


2 
1 
Pp(u,q,;v,q,)= exp |- È 


The above PSD function is defined in two independent 
phase spaces consisting of (u,q, ) and (v,g, ), respectively. 
When the hadron is at rest with 8=0, the above PSD 
function is localized in the regions 


(u?+g2)<1, (v?+g?) <l. (6.11) 


These localization regions are described in Fig. 4. When 
the hadron moves, these regions undergo elliptic defor- 
mations. 

This PSD function reproduces the distributions 
| #H(u,v) |? and | —%q,,9,)|? after the appropriate in- 
tegrals: 


| Pu,v)|2= f PS(u,vsq,.a, dguday ‚ 
Pfau, )1?= f PS u,vsq,.g, du dv . 


i a nen e er eil] 


(6.12) 


1+8 2 
1—8” 


1] (146,2, 1-6 
le) 


— 

As for the excited states, there are no timelike oscilla- 
tions in the hadronic rest frame, and the oscillations in 
the transverse direction are not affected. Therefore, the 
only factor we have to consider is the Hermite polynomi- 
al A„(z’) to be multiplied to the ground-state wave func- 
tion. The mth excited-state wave function is given in Eq. 
(5.13). In terms of the u’ and v’ variables, H,,(z’) can be 
written as?” 


H,(z')=H, ((u’ +0')/V2) 


H 


m=0 


(6.10) 


H, (u Hp (0). (6.13 


n 
m 


Thus the explicit form of the physical wave function be- 
comes 


n/2 1/2 
1 1 ee I j ; , , 
pplu, v)= 7 Br 2 m |Ta-m'u)H(v’) \expl—(u 2+v'?)]. (6.14) 
This means that we need off-diagonal PSD functions for the one-dimensional harmonic oscillator, such as 
l , 
Pam (X;p)= — S viix +y Wml ye dy , (6.15) 


to evaluate the PSD function for covariant harmonic oscillator. It is possible to evaluate this integral using the generat- 


ing function of Hermite polynomials. The result is? 


1/2 
Pam (X,p)= (nim!) 


where s is n or m, whichever is smaller. We can then go 
back to Eqs. (6.9) and (6.14) to complete the evaluation. of 
the PSD function. The localization and deformation 
properties of the PSD function for excited states are 


essentially the same as those of the ground-state oscilla- 
tor. 


$ (- IH VAx +ip)]" MV 2x —ip)]"—* 


=; klin —k)!im —k)! 


exp[ —(x?+p?)], (6.16) 


PP —— 


Let us go back to the localization problem. Unlike the 
case of light waves,” we have to deal with two phase 
spaces. If the hadron is at rest with B=O, the localization 
region can be specified by a circle in both the phase 
spaces of (u,g,) and of (v,q,). If the hadron moves, the u 
and q, distributions expand, while those of v and q, be- 
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Qu Qu 


FIG. 4. Lorentz deformations in the light-cone phase space 
consisting of two pairs of conjugate variables. The major 
(minor) axis in the uv coordinate system is conjugate to the 
minor (major) axis in the g,q, coordinate system. The Lorentz 
boost is an area-preserving canonical transformation in both 
phase spaces. For the case of localized light waves, which was 
discussed in Ref. 2, there is only one phase space. The covari- 
ant phase space given in Ref. 2 is the lower half of this figure 
consisting of v and q,. 


come contracted. 

These deformations are canonical transformations, and 
are illustrated in Fig. 4. If the hadron’s speed becomes 
close to the speed of light with B— 1, 

t=z, u =V2z š Pan 
q= W=V 2G; , 
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Introduction 


It was an unexpected pleasure to hear about the conference on the quantum 
mechanics of phase space, and I very much appreciate the pleasure to be in- 
vited thereto. I will be able to contribute very little to it that is not con- 
tained in the Physics Report article [Phys. Rep. 106, 121 (1984)] by M. Hillery, 
R.F. O’Connell, M. Scully, and myself — an article to which I have actually 
contributed, in contrast to Dr. Scully, very little. But I will admit that the 
underlying reformulation of the Schrödinger equation was started by me, in 
1932 (Phys. Rev. 40, p. 749). I was interested in the thermodynamic behavior 
of macroscopic objects which is given with high accuracy, at ordinary or high 
temperatures, by classical statistical mechanics. At low temperatures quantum 
effects can become important and this manifested itself also in the “equation 
of state” (temperature and density dependence of the pressure) of the He gas. 
It was natural, therefore, to develop a substitute for the classical expression for 
the density in phase space (to be described below) which forms the basis for the 
calculation of the thermodynamic behavior in the temperature region in which 
classical mechanics can be assumed to be valid, and which easily provides a 
good approximation in the temperature region not too far away from the valid- 
ity of classical physics. This means a probability function of the position and 
momentum variables q and p, defined in terms of the wave function w or the 
density matrix M, a probability function which is a hermitean expression of 
the wave function, hence linear in the density matrix. It is not too difficult to 
calculate and does give accurate results for the equation of state, and I hope 
also for other quantities. 

It must be admitted, of course, that the interpretation of the phase space 
density function is much less direct in the situation in which quantum effects 
play an important role than it is in the area of classical physics. The variables 
of the phase space are the 3N position and 3N momentum coordinates of the 
N-particle system to which the density function of the phase space refers. We'll 
write n for 3N. The classical phase space function’s value at a point of phase 
space is the probability that the position and momentum coordinates of the 
N particles have the values given by the coordinates of corresponding points 
in phase space. If the phase space function has to be so closely defined that 
quantum effects play a role, this interpretation is not possible because there 
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is no state in which both position and momentum coordinates have definite 
values. In fact the states of the system are not specified in terms of these 
coordinates. It follows that the interpretation of the density function of phase 
space is much less straightforward in the region in which quantum effects play 
a significant role than it is in classical theory — that is if the probabilities do not 
change significantly within distances in which the products of the p and g are 
not far from h. All this shows that the definition — and hence also the meaning 
— of the phase space functions is not as unique in the quantum region as it is 
in the region of the classical theory. The next section will therefore discuss the 
meaning and the properties of the quantum distribution function as defined in 
1932. This definition does not take care of the existence of the spin and the 
extension of the theory to the description of the spin state will be discussed 
afterwards. 


Properties of the Proposed Quantum Distribution Function 


As is apparent from the preceding discussion, the quantum distribution function 
to be discussed does not have such a simple meaning as the classical phase space 
function. It may be useful, therefore, to describe its basic properties before 
discussing its applications. 

Let us first define the distribution function P to be considered. It will be 
defined as a function of n position and n momentum variables q and p, the n 
being three times the number of particles (n = 3N). If the state of the system is 
given by a position-dependent wave function (x1, %2,--+-,Zn), the distribution 
function is 


1 x 
Play qai Pi Pa) = an | f da dunt (qı + Y1: dn + Yn) 


x plq — Y1; , An — yn ert Pint +Pnyn)/R : (1) 


This is, clearly, a nonrelativistic definition — as is fundamentally also that of the 
classical distribution function — but has, similar to that, some useful properties. 
These will remain also after the introduction of the spin variables. Before enu- 
merating its useful properties, it may be good to give the P for a density matrix 
M(q1,°°+34n344)°'*>G,)- This can be decomposed into orthogonal and normal- 
ized wave functions %1,%s2,‘'-- which appear with probabilities w 1, we,---. If 
the distribution function is assumed to be an additive function of these: 


M(q1, aca ‚An; qis’ oe Qn) = > wrbr(q; eer On) Vea, puree An); (2) 


it is natural to define the corresponding P as 


1 


P(q,p) = AP J dy M(q — y, q + yje” PIF. (2a) 


In this equation, as in many later ones, the symbols q, y and p represent n 
variables each, f dy means integration over the n variables y, and (p - y) is the 
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scalar product J` pkyk. The notation used in these equations renders several 
future equations much simpler. Similar to the meaning of dy, the dq will mean 
integration over the n variables g and dp means integration over the n variables 
p. These notations simplify several of the following equations. 

Let me now come to the properties of the distribution functions (1) and 
(2a) which I consider to be of significance. The proofs will be given for (1) but, 
because of the definition (2) of M, it will be evident that they apply also for 
the more general form (2a). 

1. P(q, p), if integrated over p (that is over p1, p2,-*-, Pn), gives the proba- 
bility of the configuration q that is the probability that the position coordinates 


are 41, Q2,°"',Qn- This is easily verified. 
2. Similarly, if P(q, p) is integrated over the q, it gives the probability that 
the momentum coordinates have the values p1, po,---,Dn- 


These two properties can be easily verified, and it is clear also that they are 
less significant than the basic property of the classical P which represents the 
probability for both the positions to be given by q1,:--, Qn, that is by q, and the 
momenta by p. But they do show that the average value of the classical energy, 
being the sum of two functions, one of the momentum the other of positions, 
can be easily obtained. 

3. These two observations suggest that q and p play similar roles in the def- 
inition of the distribution function P. Indeed, if the ~(q) is expressed in terms 
of its Fourier transform, the wave function x(p) of the momentum coordinates: 


Ya) = J x(p)ei? Vp, (3) 


where we neglect constant factors temporarily, we obtain for P(q, p) 


P(q,p) = 1 J J x (pe P O+) hy(p" jei" (I-W)/F e2tp-y/h dn! dn"dy. (3a) 


The factors involving y give a delta function 6(2p — p’ — p”) so that, again disre- 
garding a constant factor, we can set p = p + z, p” = p — z and the integration 
over z will replace the integration over p’ and p”. Hence (3a) becomes 


P(q,p) = (rh)”* / dzx* (p + z)x(p - 2)e /R. (3b) 


The numerical constant before the integral sign follows from the fact that the 
integral of P remains 1 and that the x are also normalized. Eq. (3b) is a close 
analogue of (1), except for the fact that i is replaced by —i — which is natural — 
and shows that position and momentum coordinates play essentially the same 
role in the definition of our distribution function — just as they do in classical 
theory. 

4. The transformation properties of P are the classical ones with respect 
to any of the classical transformations. The substitution of q + a for q clearly 
gives P(q + a,p) from P(q,p) - and this remains true even if a is not the 
same vector for all particles. If % is replaced — we use (1) in this discussion 
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— by e’*/R,) the distribution function so obtained assumes the values of the 
original distribution function if p + « is substituted for p — actually « can be 
an arbitrary n dimensional vector, but naturally independent of the g. 

The past three points are natural demands and are easily verified. 

5. The so-called transition probability between two states, ¢ and w for 
instance, is , as a rule, not really observable. If the system is in the state w and 
an observation is made as a result of which the system’s state vector becomes 
$, the probability of this result of the observation is (if both ¢ and w are 
normalized) |(&, %|?, the absolute square of the scalar product of the two state 
vectors. The observation in question is not possible for every state vector ¢, but 
the existence of the scalar product, or at least the measurability of its absolute 
value, is often assumed for all ¢. If the original state of the system is best given 
by a density matrix M, the probability that the measurement transfers it into 
the state & is given by the scalar product of ¢ and Mọ, that is (¢, Mg). It is 
worth noting therefore that if Py and P, correspond, by (1), to ~ and ¢, the 
so-called transition probability between them becomes 


Ip, o)l? = (2rh)" J J dpdq Py(a,p)Ps(a;P). (4) 


It follows from (4) also that if Pu and Py are the distribution functions which 
correspond to the density matrices M and N, then 


Trace (MN) = (2rh)" JJ dpdq Pm (q, p)Pn (q, p). (4a) 


All the preceding observations are easily verified and are also contained in the 
aforementioned article of Hillery, O’Connell, Scully and myself — most are in 
fact also contained in the aforementioned 1932 article. Apparently, there is a 
great deal of arbitrariness in the definition (1) of the distribution function but 
R.F. O’Connell has shown that some of the preceding properties already fully 
determine it. This was not known when (1) was originally proposed but is well 
worth remembering. 

The last observation, Eq. (4), also shows that most distribution functions, 
though real, are not everywhere positive. For two orthogonal wave functions, 
(p, p) = 0, the integral over P,P, must vanish. They can not be both positive 
everywhere — for most w and ¢ neither is. But, as Heisenberg pointed out, there 
is no state for which both p and q have definite values. Transition probabilities 
are observable, at least many of them, and it is satisfactory that, according to 
(4), the expressions for these cannot be negative. The fact that most functions 
of p and q do not represent possible states renders the quantum distribution 
function to be a less simple quantity than is the classical distribution function, 
since, in classical theory, all everywhere non-negative distribution functions are 
conceivable. This point will be mentioned again later, together with the fact 
that the condition which an arbitrary function of p and q must obey in order to 
be a possible distribution function is not simple. Clearly, it must be possible to 
write it in the form (2a) with a positive definite (or non-negative) self-adjoint 
matrix M, but this is not a simple condition. 
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6. The preceding observations on the properties of our quantum mechanical 
distribution functions gave properties which the classical distribution functions 
also had - in fact the properties of the latter were more general. We now come 
to an equation which shows the quantum mechanical nature of our distribution 
function — the equation of its time dependence. Essentially the same equation 
will be used afterwards to determine the distribution function for the thermo- 
dynamic equilibrium. 

The equation for 0P/Ot has two types of terms. The first type originates 
from the kinetic energy terms —(h?/2m)6/0q? of the expresssion for shörb/öt, 
the second one from the potential energy terms. Both are easily determined 
and were long ago. Here only the first one will be reproduced in full detail. It 
gives for (rh)"(OP/0t), 


th? a 


a a Mae e= e | tool ay. (5) 


The second derivatives with respect to q can be replaced by second derivatives 
with respect to y and a partial integration then be carried out. The two terms 
in which the products of both first derivatives appear then cancel and the terms 
in which the exponential is differentiated gives 


The differentiations with respect to y can be replaced by differentiations with 
respect to q — changing the sign of the second term. The result then is the 


same expression which appears in classical theory for the kinetic energy part 
(ƏP/ðt)k of OP/Ot - if written in detail it is 


(3 hae : os a 


Ot A mer Ik 


The potential part of 0P/Ot can be expressed in two ways. One can expand 
the potential energy expression in 


(mh)" Fe = feu + y)[-V(qt+ y) + Vlq - y)lyla — y)e*P4/"dy (6) 


either into a power series of y, or represent it as a Fourier transform. The 
second possibility shows again that p and q play similar roles in the theory of 


distributions. But the expansion of V (q — y) — V (q + y) as a power series of y 
gives 


1H tàn 
-V(aty)+V(g-y)= > 2 at o) 


Ai: 
y Ay)? -yàn 
( eee ). Oe) 
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the summation to be extended to all non-negative (integer) A the sum of which 
is odd. This gives for the potential caused part of the time derivative 


(=) B D | 
at), ox Aa! 


8At +An AA+ +Àn 
\ 4 x we? (6b) 
ðq; lons Ogn” Ope’) eee Opp” 
in which, however, all the À are non-negative and their sum odd. The first term 


of the series, in which one À is 1, all others 0, gives the classical expression 
for (3P/ðt)p. The lowest order corrections contain the second power of fi. And 


they constitute for 
ðt \ ae Je t), 


the lowest order corrections. None of the preceding considerations is new, nei- 
ther is the last point of this section. 

7. The oldest use of the quantum mechanical distribution function was 
based on the calculation of the quantum effects on the equations of states of 
gases. If Bose or Fermi statistics of these is disregarded, (this was treated later, 
in 1984 by O’Connell and Wigner) the distribution function of these is the 
normalized form of e~#/*?. Setting 1/kT = ß, this can be written as e~?” 
and the equation which replaces the equation for 0P/Ot becomes 


— = —HP. (7) 


The expansion of P in terms of @ has been discussed when the expression (1) or 
(2a) was first proposed and is reviewed also in the article by Hillery, O’Conell, 
Scully and Wigner mentioned several times before. There is no point repeating 
the calculation which replaces the calculation of the distribution function for 
e!H/iR by the calculation of the distribution function for e~°”. Perhaps I men- 
tion that the first application of the quantum mechanical distribution function 
concerned the equation of state of the He gas. At very low temperatures the 
experimental results deviated considerably from that given by classical the- 
ory, that is by the classical distribution function. The correction introduced 
by the quantum corrections to this discussed here were in the right direction 
but accounted only for about 2/3 of the deviations from the experimental mea- 
surements. It is possible that the reason for this was that the potential energy 
function was not known well enough. It would therefore be worthwhile to repeat 
that calculation. Its desirability was actually the stimulant for the introduction 
of our P. 
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The Spin Variable 


The preceding discussion largely disregards the spin variable — which is natural 
in the case of the He gas, since the He atoms have no spin. A possible way to 
add the description of the spin state to that of the other variables was discussed 
before (1983) for systems with spin 1/2 but that is easily generalized for larger 
spin. 

For every particule of spin s the density matrix has (2s + 1)? components. 
The problem is only to find such linear combinations of these components which 
have relatively simple properties. We can specify the (2s +1)? components with 
two index symbols: u and p’ — the first giving the row index of the density ma- 
trix, and p’ the column index. Both run for each particle of spin s from —s to 
s in integer steps. We can then form, for each particle, another description of 
the spin state by combining the row and column components to have simple 
transformation properties. They will have transformation properties which cor- 
respond to the direct product of two representations D(®). It is possible then 
to produce linear combinations of the components characterized by u and u’ 
which transform under rotations by the representations DO), D@),..., D(s). 
In the case of s = 1/2, which was considered before, there is a scalar and a 
vector component ~ the former giving the total probability, the others being 
formed by the components of DV. 

Let us denote the density matrix by M (£, u; €’, u’), u giving the row index 
of the spin variable of the particle in question, € denoting all other variables 
of the row, u’ and €’ the same interpretation for the columns. The distribution 
function proposed would replace the u and u’ by the indices S and m: 


M'(E, g ) S, m) == > (S, m; 5, H, 3 w)M(&, H; E’, w) I (8) 


pp! 


the first factor after the summation sign being the coefficient which transforms 
the representation of the direct product D©)” x D() into D(S), and m, u and 
u’ are the row indices of the representations S, and DW)“ and D(s). It would 
not be reasonable to produce here these coefficients in general but it may be 
worth noting that | 


M'(6,6:0,0) = =—— JD M(E, m6, u): (8a) 
u 


For the case of s = 1/2, the S assumes only two values: 0 and 1. The coefficients 
for 0 are given in (8a), those for S = 1 the transferred M, that is the M’, 
were given as the expectation values of the x,y, and z components of the 
spin operator, that is of Sz, Sy, Sz. In many cases the effects of the higher $ 
components of M’ are insignificant and in those cases the same transformation 
of the spin coordinates can be recommended. 

The total transformation to the quantum mechanical distribution function 
P obeys then the same equation as in the absence of spin (4) and the u and p’ 
for every particle are replaced, in terms of (8), by S and m. 
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This is a somewhat superficial description of the transformation of the spin 
variables for what I call the quantum mechanical distribution function, but I 
hope that it gives the proposed transformation clearly enough. 


Problems of the 
Proposed Quantum Mechanical Distribution Function 


The quantum mechanical distribution theory here described has two weak- 
nesses. One of these was mentioned before: given an arbitrary real function 
P(p,q), it is not clear whether it is a possible distribution function. If it is, it 
can be written in the form (2a) in terms of an acceptable density matrix M but 
the acceptability of a density matrix is also not easily verified. In particular, it 
must be positive definite, or semidefinite — that is no expectation value of the 
transition to any state, that is no (¢, M¢) can be negative. This applies also 
to our P: no integral of the product of two quantum mechanical distribution 
functions can be negative. 

Just as in the usual theory, it is sufficient to demonstrate that the product 
is non-negative with any distribution function representing a single state, that 
is having the form (1), but even this is an infinite task — just as it is in ordinary 
quantum mechanics dealing with density matrices. 

The other difficulty well worth mentioning is one also shared, at least to 
some degree, with the usual formulation of quantum mechanics: the postulate 
of the coherence with relativity theory. This causes difficulties also in the usual 
theory — it is necessary to introduce a field, that is an infinitely more complex 
definition of the state than is used in Schrödinger’s old fashioned theory. In 
addition, the equations often lead to infinities and these must be eliminated 
by “renormalization”. In summary, even the usual theory has weaknesses — I 
would say that its beauty is not absolute. 

But the weakness of the theory here discussed is much more fundamental 
— at least it is so at present. It assumes that the interaction of the particles 
is instantaneous — that it depends only on their same-time positions. This is 
acceptable, and in fact generally accepted, in non-relativistic theory but is in 
conflict with the theory of relativity in which simultaneity is not independent 
of the state of motion of the coordinate system describing it. This renders, 
quite generally, the description of the states of systems by phase space func- 
tions unattractive — in phase space the interaction is assumed to depend on 
the simultaneous position of the particles and is, therefore, not relativistically 
invariant. It is possible to make it invariant, for instance by postulating that it 
depends on the distance in the coordinate system at rest with the temporary 
center of mass of the particles, or to depend on the two positions at the time 
when their relativistic distance is zero — when one is on the light cone of the 
other or conversely. It is even possible to assume a “force” depending on the 
integral of the distances between the light cones. But these possibilities have 
not been explored to my knowledge and the present theories assume interac- 
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tions of fields - i.e. only interactions at points of the same positions and times. 
These gave many apparently correct results but needed the introduction of 
“fields”, in particular electromagnetic potentials, and are not in harmony with 
the phase-space theories. Perhaps this could be amended, but I do not know of 
serious attempts in that direction — not even by myself. 
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The Lorentz deformation property of the phase-space distribution function is studied for harmon- 
ic oscillators. The overlap effect of two distribution functions is discussed in detail. It is shown that 
the Lorentz deformation of the phase-space distribution is responsible for the polynomial cutoff be- 
havior of the proton form factor in the harmonic-oscillator quark model. 


L INTRODUCTION 


The phase-space representation of quantum mechan- 
ics! is of current interest.” ”* We have shown in our pre- 
vious papers’'® that the light-cone coordinate system is 
the natural language for the Lorentz-covariant phase 
space representation of quantum mechanics. The 
Lorentz-deformed phase-space distribution was discussed 
in detail for localized light waves’ and harmonic oscilla- 
tors.° It was shown in Ref. 6 that the covariant harmonic 
oscillator can illustrate the Lorentz-deformed phase- 
space distribution for a relativistic free particle with a 
space-time extension. 

The purpose of the present paper is to study the over- 
lapping phase-space distributions functions. We are of 
course interested in Lorentz-deformed distributions, and 
in how their effect manifests itself in the real world. For 
this purpose, we shall study the electromagnetic structure 
of nucleons. Because the radius of the proton is 10° times 
smaller than that of the hydrogen atom, the proton had 
long been thought to be a point particle. However, 
Hofstadter’s discovery in 1955 clearly demonstrated that 
the proton has a spread-out charge distribution.’ 

Although there had been many attempts to understand 
the structure of nucleon since 1955, the first comprehen- 
sive approach to the probability distribution of hadronic 
matter was the quark model, in which the nucleon is a 
bound state of three quarks. Among the mathematical 
models for this bound state, the harmonic-oscillator mod- 
el gives a simple explanation for a wide range of hadronic 
phenomena observed in high-energy physics labora- 
tories.°~'' While the major strength of the oscillator 
model is its mathematical simplicity, its most useful 
property for our present purpose is that the oscillator 
model constitutes a representation of the Poincaré 
group.'2~'!© The harmonic-oscillator model can be made 
covariant. 

Another important property of the oscillator model is 
that it is the natural language for phase space.”®!7 
Therefore the harmonic-oscillator model is an effective 
scientific language for the covariant description of phase 
space. We shall study in this paper overlapping distribu- 
tion functions in the covariant phase-space formulation 
of harmonic oscillators. 


39 


In Sec. II we summarize the earlier works on overlap- 
ping phase-space distribution functions and their physical 
interpretations. Section III explains how the form factor 
is defined and why a Lorentz-covariant formalism is 
needed for studying the form factor. Section IV deals 
with the covariant phase-space distribution functions 
which are needed for calculating the form factors. In 
Sec. V we study how the overlap of two Lorentz- 
deformed phase-space distribution functions lead to the 
correct form-factor behavior in the harmonic-oscillator 
model for hadrons. 


1I. OVERLAPPING PHASE-SPACE 
DISTRIBUTION FUNCTIONS 


If (x) is a solution of the time-independent 
Schrödinger equation, its phase-space distribution func- 
tion is 


Wolxipl=— fox +y)blx —y)e”Pdy . (1) 


For simplicity, we shall use the term “PSD function” for 
phase-space distribution function. This is a function of x 
and p which are c numbers. This function is real but is 
not necessarily positive everywhere in the two- 
dimensional phase space of x and p. We can, however, 
recover the positive distribution functions in the position 
and momentum coordinates by integrating the PSD func- 
tion over p and x, respectively,» 


p(x)= | W,(x,plap, o(p)= | W,(x,pidx a (2) 


In this paper, we are interested in two overlapping 
PSD functions. Indeed, the overlap integral becomes the 
absolute square of the inner product of the two wave 
functions in the Schrödinger picture. If W,(x,p) and 
W,(x,p) are ar PSD functions for Y(x) and ¢(x), respec- 
tively, then! 


[wur pW ax, p)dx dp =(1/2m (60), Wa?) 


This expression is non-negative, but can be zero if the 
two functions are orthogonal, indicating that the PSD 
functions are not always positive everywhere in phase 
space. 
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In studying the interaction of a photon with an atomic 
system, we often encounter the matrix element of the 
form 


M; =p ey). (4) 


This is the i inner product of the wave functions ¢ and y’ 


with #'=e!y. Thus we have to construct the PSD func- 
tion for y’, 

Wuinp)= forex +y)blx —y)e? -Ydy (5) 
This leads to 

Wy (x,p)=W,(x,(p —k)) . (6) 
Therefore 
I(d,e"yl?=2r f Wylx,p)Wylx,(p —k))dx dp. D 


We have thus far carried out the formalism for one- 
dimensional space. The generalization to the three- 
dimensional space is straightforward and has been dis- 
cussed in the literature.!:? 


Il. FORM FACTORS 


If electrons are scattered by a charged point particle, 
the scattering amplitude in the Born approximation is 
f (@)=2me? Ak, —k; )?, On the other hand, if the elec- 
tron is Saded by a spread-out charge due to quantum 
probability distribution, the scattering amplitude is 


f(0)=(2me?/K?)F(K?). (8) 


where K=k,—k,, and K’=(k,—k,)?. F(K?) is called 
the form factor and takes the form 


F(K?)=(bpe EW = f ipa] yae Ed? , 
(9) 


with F(0)=1, if the initia]- and final-state wave functions 
are the same. If Ey] %,(x)} describes a point-charge 
distribution with (x), F(K?)=1 for all values of K?. 
According to Eq. (3), the form factor should take the 
form 

|F(K?)P?= f W,(x,p)W;(x,(p—K))d?x d3p . (10) 
This is a generalization of Eq. (7) to the three- 
dimensional space. 

As the energy of incoming electrons becomes higher 
for the fixed nucleon target, K? becomes very large, and 
the problem becomes relativistic. For the electromagnet- 
ic interaction of point particles, we have to use quantum 
electrodynamics, where the scattering amplitude is ex- 
panded in a power series of the fine-structure constant 
a=e?/4r in the Lorentz-Heaviside unit. The lowest 
nontrivial term in this expansion is essentially a relativis- 
tic version of the Born approximation. 

In lowest order in a, we can describe the scattering of 
an electron by a proton using the diagram given in Fig. 1. 
The corresponding matrix element is given in many text- 
books on elementary particle physics.'® It is proportional 
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proton 


FIG. 1. Elastic electron-proton scattering. The electron 
behaves like a point charge. However, the proton has its ha- 
dronic structure, and has a spread-out charge distribution. 
to 

e?[ UP, IF «PP; )U(P,)1/K?)[ Ulk, IyFU(k,)], 
(11) 


where P;, P,, k;, and k, are the initial and final four- 
momenta of the proton and electron, respectively. U(P,) 
is the Dirac spinor for the initial proton. We use here the 


four-vector convention x“=(x,y,z,t). K? is the four- 
momentum transfer squared given by 
K?=K?—Kj=(P,—P,)=(kj—k,)? . (12) 


The 1/K? factor in Eq. (11) comes from the virtual pho- 
ton being exchanged between the electron and the pro- 
ton. In the metric we use, the quantity is positive for 
physical values of the four-momenta for the particles in- 
volved in the scattering process. 

The function T(P,,P,;) in Eq. (11) represents the 
closed circle in Fig. 1 and carries the effect of the nucleon 
structure. If the proton were a point charge, we would 
have [,=y,. If the proton has an extended charge 
erasure: we will be inclined to write it as r= =y,F (K 2); 
However, the proton and neutron have anomalous mag- 
netic moments whose values are 2.79 and — 1.91 in units 
of e/2M for the proton and neutron, respectively, where 
M is the nucleon mass. If we include these observed 
anomalous magnetic moments, [F should be written as 


D =Y )F (K?)+i(0,.K"/2M )F{(K?) . (13) 


The form factors are scalar functions in the Lorentz- 
invariant variable K?. When we compare F,(K?) and 
F(K?) with experimental data, it is more convenient to 


use the following linear combinations: 
Gy(K7)=F,(K?)+F,(K?) , 3 
1 
G_(K?)=F,(K?)+(K*/4M?)F,(K?) . 


These form factors should be written for the proton and 
the neutron separately. We may use the superscripts p 
and n to distinguish them. When K?=0, 
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G£,(0)=", =2.79, GE(0)=1 (proton) ; 


Gü(0)=u, =—1.91, GE(0)=0 (neutron) . 
These numbers are the magnetic moments and electric 
charges of the proton and neutron, respectively. 

Among the many attempts to understand the form fac- 
tors, the quark model appears to be the most promising 
approach.®!0!9 In this model, the nucleon consists of 
three nonstrange quarks. There are two nonstrange 
quarks called u (up) and d (down) which have electric 
charges 2 and —4 respectively. The proton consists of 
two u quarks and one d quark, and the neutron is made 
up of one u quark and two d quarks. The neutron charge 
is therefore zero, while the proton charge is 1. 

Indeed, one of the early successes of the quark model 
was the calculation of the magnetic moment ratio 
Yn/u,=—3.”° However, it is even more challenging to 
calculate the form factors for increasing values of K?.'%'? 
At present, we can make the following experimental ob- 
servation. For the four form factors in the nucleonic sys- 
tem given in Eq. (14), the neutron charge form factor is 
zero at K?=0, and remains small (not zero) for all values 
of K?.?! The three remaining form factors decrease like 
1/(K?)? as K? increases beyond the value of the nucleon 
mass squared.”” This behavior is usually called the dipole 
fit. We are interested in the question of whether each of 
the form factors can be written in terms of the single 
form factor G(K?), multiplied by a constant, where 
G(K?) is normalized as G(0)=1, and is proportional to 


(15) 


Point Charge 


Lorentz - deformed 
Phase Space Distributions: 


G (Kĉ) 


Reol World 


Goussian 
without Deformation 


IO 20 30 
K? [(Gev/c)?] 


FIG. 2. Form-factor behaviors for increasing values of K?. If 
the proton is a point charge, the form factor should be indepen- 
dent of K?, as is illustrated by the horizontal line. If the charge 
distribution is Gaussian, the nonrelativistic calculation leads to 
an exponential cutoff in X?. The relativistic calculation gives a 
reasonably accurate description of the real world. At present, 
the experimental data are available for G} (K?), GE(K?), and 
GĻ(K?) from K?=0 to 25, 15, and 7 (GeV/c)’, respectively. 
They are all consistent with the relativistic calculation with 
Lorentz deformation. i 
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1/(K?)? for large values K?, as is illustrated in Fig. 2. In 
the case of the neutron charge form factor, we have to 
multiply G(K?) by zero within the framework of the 
model in which the spin, unitary spin, and spatial wave 
functions are factorized.?! The question is whether it is 
possible to calculate the above-mentioned dipole behavior 
of G(K?) using the wave functions obtained from the 
quark model. 

Another important aspect of the quark model is that 
the forces between the quarks are like harmonic oscilla- 
tors, and the hadronic mass spectrum is consistent with 
the equal mass-squared spacing predicted by the oscilla- 
tor model.”'® In addition, the parton distribution shows 


`a Gaussian shape in the region where the structure func- 


tion can be measured accurately.!! 

It is therefore a reasonable approach to calculate the 
nucleon form factor assuming that the nucleon is in the 
ground state of a harmonic-oscillator system. However, 
the Gaussian distribution gives an exponential cutoff of 
the type expl —K?/40.), where Q is the spring constant 
of the oscillator system. This contradicts the experimen- 
tal observation, as is indicated in Fig. 2. It is therefore 
interesting to see whether the effect of Lorentz deforma- 
tion in phase space could transform this exponential de- 
crease into a polynomial cutoff. 


IV, COVARIANT PHASE-SPACE 
DISTRIBUTION FUNCTIONS 


The covariant phase-space representation for harmonic 
oscillators has been discussed in Ref. 6. As in Ref. 6, we 
start with two quarks bound together by a harmonic- 
oscillator potential. Then the convenient coordinate vari- 
ables are!’ 


X =(x,+x,)/2, x =(x,—-x,)/2V2. (16) 


With these variables, the hadronic wave function takes 
the form 


Q(X, x)=et Ay), (17) 
where P is the hadronic four-momentum. The wave 
function Y(x) describes the internal motion of the two- 
quark system. The preceding form as a representation 
space of the Poincaré group’? for relativistic extended 
hadrons has been discussed in the literature.'® 

As for the four-momenta of the quarks p, and p,, we 
can combine them into the total four-momentum and 
momentum-energy separation between the quarks,’ 


P=p,+p,, g=V2Up,—Ds) » (18) 


where P is the hadronic four-momentum conjugate to X. 
The internal momentum-energy separation g is conjugate 
to x provided that there exist wave functions which can 
be Fourier transformed. 

If the hadron moves along the z direction with velocity 
parameter 8, the hadronic rest frame is important. In 
this frame, the coordinate variables are 

x'=x, y'=y, 


z'=(z —Bt)/(1—B*)'” , (19) 


t=(t —pz)/1 -B® . 
Since the x and y variables are not affected by boosts 
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along the z direction, and since the harmonic-oscillator 
system is separable in the Cartesian coordinate system, 
we can drop these variables from the wave function. It is 
important to note that t and rt’ in Eq. (19) are the time- 
separation variables between the quarks. It was shown 
that the ground-state harmonic-oscillator function for the 
moving hadron takes the form 
1/2 

expl —Q(z"+1'7)/2] . 


Pp(z,t) = (20) 


1 
T 
Q is the spring constant for the oscillator system. For 
simplicity, we can use the unit where Q=1, and restore 
this factor when we are ready to compare our calculation 
with experimental data. 

This wave function can be written in the light-cone 
coordinate system, where the coordinate variables are 


u =(t +z)/ V2, v=(r-z)/V2, (21) 


and 


9,=19,-g0)/V2, g, =(q; +40)/ V2 . (22) 


In this coordinate system, the Lorentz boost of Eq. (19) 
takes a form 


N B 1/2 1+6 1/2 
u = 1+8 ‚u = 1-8 U, 

EDEL Be (23) 
qy = 1-B gu 4 7 1+8 qy : 


The wave function of Eq. (20) then becomes 


1/2 
Welz, t)= 4 exp[ -(u"?+v'?)/2] 
mag 1{1-B ,, 1+B 
= |— -+ J2 P ,24 427PR 2 
l ex 5 148" |. 


(24) 


The Lorentz-deformation property of this wave function 
has been discussed in the literature. '© 

For the ground state, the PSD function can now be 
defined as® 


2 
1 
W plu sry) = B STysu +x,v +y)]* 


X oglu —x,v —y) 
Xexp[2i(g,x +g,y)] 


Xdx dy . (25) 


After the evaluation of this integral, the PSD function be- 
comes 
W gl us Qu3sdy =I glu, g, J _glv,g,) , (26) 


with 
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1-8 2, 1#B : 
148“ 1-8“ 


1 
Jglu,q,)= Texp |- 


The PSD function W,(u,g,3v,g,) can be separated into 
two phase spaces consisting of (u,g, ) and (v,q, ), respec- 
tively. When the hadron is at rest with 3=0, this PSD 
function is localized in the regions 


(u2+q2)<1, (v?+q2)<1. (27) 


When the hadron moves, these regions undergo elliptic 
deformations.® 

It is straightforward to generalized the preceding cal- 
culation to the three-quark system in the harmonic- 
oscillator regime. !>!6 If we let x,, x,, and x, be the 
Space-time coordinates of the quarks, it is more con- 
venient to use the variables 


X=Hx,+x,+x,), 


(28) 
r=1(x,+x,—2x,), s=4X,%,), 
and their conjugate variables, 
P=ptptP. » 
Sn (29) 


g=p,+p-2p., K=V3(py—pg) « 


In terms of these variables, the covariant harmonic- 
oscillator wave function for the three-particle bound sys- 
tem takes the form 


¥p(r,s)=(1/mexp[ —(0/2)(r'2+r'3+s'3+5'5)], (30) 


where, as in Eq. (20), the transverse components have 
been ignored. The primed coordinate variables are those 
in the hadronic rest frame. This function can also be 
written in terms of the light-cone coordinate variables, 
just as in the case of Eq. (24). It is then straightforward 
to construct the covariant phase-space distribution func- 
tion. 


V. CALCULATION OF THE FORM FACTOR 


Let us now go back to the expression for the form fac- 
tor in Eq. (10). Since the harmonic-oscillator model gives 
a reasonable description for the mass spectrum of non- 
strange baryons,” and since the shape of the proton struc- 
ture function shows a Gaussian behavior where the ex- 
perimental data are accurate,!! we are compelled to cal- 
culate the nucleon form factors with the ground-state 
harmonic-oscillator wave function. 

For a two-body bound state, the nonrelativistic calcula- 
tion without the Lorentz-deformation effect gives the 
form factor of the form 


g(K?)=exp(—K?/4Q) . (31) 


We use g(K?), instead of G(K?), for the two-body bound 
state. This expression does not lead to a polynomial 
cutoff for large values of K’, and therefore is not con- 
sistent with the real world as is described in Fig. 2. The 
story is the same for G(K7) for the three-body bound 
state. 

We are interested in the question of whether the 
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FIG. 3. Breit frame for electron-nucleon scattering. The 
momentum of the outgoing nucleon is equal in magnitude but 
opposite in direction to that of the incoming nucleon. 


Lorentz effect on the Gaussian distribution will lead to a 
dipole fit. For this purpose, let us go to the Lorentz 
frame in which the momenta of the incoming and outgo- 
ing nucleons have an equal magnitude but opposite signs, 
as is described in Fig. 3. Then 


P,+P,=0. (32) 


The Lorentz frame in which this condition holds is usual- 
ly called the Breit frame.” We assume that the proton 
comes in along the z direction and goes out along the neg- 
ative z direction after the scattering process. In this 
frame, the four-vector K=(k,—k;)=(P;—P,) has no 
timelike component. Thus the exponential factor 
exp(—iK-x) can be replaced by the Lorentz-invariant 
form exp(—iK-x). 

We can use the covariant harmonic-oscillator wave 
function discussed in Sec. IV for the proton. If we as- 
sume for simplicity that the proton is a bound state of 
two quarks, and the form factor should take the form 


B(K2)= f ypx Kase . (33) 
Then the only difference between this form and the non- 
relativistic cases is that the integral of Eq. (33) requires 
an integration over the timelike variable. This time- 
separation variable has been thoroughly discussed in the 
literature.%'%16 If B is the velocity parameter for the in- 
coming proton, Y; and w, in Eq. (33) should be replaced 
by Yg and d_, of Eq. (24), respectively. The form-factor 
integral in the Breit frame takes the form 


g(K2)= fyt p(z,thbglz,ne az dt . (34) 
In terms of the light-cone variables, 
g(K?)= f Pt u,v bglu,vle Kt dudo, (35) 


where K is the magnitude of the vector K. The form fac- 
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Qu 


Lorentz- deformed 
Distributions 
8=0.8 


Without 
Deformation 


FIG. 4. Lorentz-deformed phase-space distribution functions 
and their overlaps. According to Eq. (27), the J, function of 
Eq. (26) is localized within a circular or elliptic region. As the 
momentum transfer increases, the PSD functions become 
separated. Without Lorentz deformation, the PSD functions 
become completely separated in the overlap integral of Eq. (35). 
This lack of overlap is the cause of an unacceptable exponential 
cutoff in K?. However, the Lorentz-deformed PSD functions 
maintain a small overlapping region as K? increases. This leads 
to a polynomial decrease of the form factor. 


tor can then be computed from the overlap integral of 
two Lorentz-deformed PSD functions, 


lg (K?)|? 
=(27}? f W _p(usqys0»Q,) 
X Walu,(g, —K V2;0,(9, ~K /V2)) 


Xdu dq,dv dq, . (36) 


Then, in terms of the Jg function defined in Eq. (26), 
g (K?) takes the simpler form 


g(K?)=2n f J_pluyg, J plu, (q,-K/V2))du dq, . 


(37) 


This overlap integral of the PSD function is illustrated in 
Fig. 4. The evaluation of this integral is straightforward, 
and the result is 


g(K*)=[(2M?/(2M?+K?)] 


Xexp{ —M?K?2/[20(2M?+K?)}} . (38) 


This expression becomes the nonrelativistic form of Eq. 
(31) for small values of K? and becomes 1 for K?=0. It 
decreases like 1/K* as K? becomes large, but does not de- 
crease like 1/(K?)?. How are we going to get an extra 
1/K? factor? 

Since there are three quarks inside the nucleon, there 
are two oscillator modes. We can therefore expect that 
each mode will contribute a 1 /(K”) factor to give the net 
decrease of 1/(K?)? as K? becomes very large. The gen- 
eralization of the form factor calculation to this three- 
quark system is straightforward, !'!>!° for the harmonic- 
oscillator wave functions. Since the oscillator wave func- 
tions are separable, the construction of the PSD function 
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is also straightforward. The result of the calculation is 
G(K?)=[2M?/(2M°+K°)]? 
Xexp{ —M?K?/[Q(2M?+K?’)]} , (39) 


which is 1 at X?=0, and decreases as 1/(K?). The be- 
havior of this function is illustrated in Fig. 2. 

The experimental curves are nicely summarized in Ref. 
22. For protons, the data for the magnetic and electric 
form factors are available from K?=0 to 25 and 15 
(GeV/c)*, respectively. For neutrons, the data for the 
magnetic form factor is available from K?=0 to 7 
(GeV/c)*. All these data are consistent with the form 
given in Eq. (39) and illustrated in Fig. 2. 

As for the charge form factor of neutrons, the 
coefficient to be multiplied to G (K?) of Eq. (39) is zero in 
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been discussed by Hussar and Haberman in their recent 
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Throughout this paper, we ignored the effect of spins 
and assumed that the nucleon form factor can be decom- 
posed into the form of Eq. (13) in the quark model. 
Indeed, there are models in which the quark spins can be 
combined for the nucleon to give the form of Eq. (13).24?° 
On the other hand, we still do not know how to treat the 
spins in the covariant phase-space formalism. This is a 
challenging future problem. 
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PART II 


Solid State Physics 


Wigner on Solid State Physics 


Annotation by Walter Kohn 


I. Introduction 


Although the published work of Eugene Wigner on the theory of solids essen- 
tially extends over only 6 years, from 1933 to 1938 (not including three overview 
papers with F.Seitz between 1953 and 1956), and consists of only 12 papers 
out of a total of over 500, Wigner is justly regarded as one of the fathers of the 
modern theory of the electronic structure of metals. 

Important insights had preceded his work on the electronic structure of 
metals: The realization that the Pauli exclusion principle is essential for an un- 
derstanding of metals (Pauli and Sommerfeld); the origin of the strong magnetic 
coupling in ferromagnets (Heisenberg); the importance of energy band structure 
(Bloch). What Wigner and his students brought to this subject was, primarily, 
quantitative quantum mechanical theories, needed to calculate total electronic 
energies. From these cohesive energies, structures and elastic coefficients could 
be calculated. Wigner’s interest in energies, and derivative quantities, brings to 
mind his background in chemistry, where energies play a dominant role. Much 
of the work [1-4] has a distinctly practical character. One senses that, as a for- 
mer engineer, he was after a good, at least semiquantitative answer, and was 
ready to use any tools which would lead there. The problem was formidable 
because of the combined major effects of strong ionic potentials, the Pauli ex- 
clusion principle and strong electron-electron interaction effects. There is no 
small parameter which would permit systematic perturbation expansions, so 
in many places Wigner had to rely on brilliant intuitive insights (see espe- 
cially [2]). In the course of his studies of the alkali metals, Wigner was led to 
investigate the idealized model (later known as the jellium model) in which 
electrons move in a large volume filled by a static, uniform charge background. 
This model had been previously considered by Pauli, Sommerfeld and others 
under neglect of the dynamical effects of the electron-electron interaction (the 
so-called free electron model). Wigner realized that for estimates of the co- 
hesive energies these dynamical effects, called correlation energies, could not 
be neglected. Partly together with his student F. Seitz, he succeeded, by hook 
and by crook, in successively improved calculations, to obtain this correlation 
energy Ee, as function of the electronic density ({2—4]). This work, which has 
withstood the test of time astonishingly well, may be considered the beginning 
of a later very important field of physics - many body theory, — the theory of 
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very many strongly interacting particles (atoms in classical and quantum liq- 
uids, electrons in solids, nucleons in nuclei, etc.). Wigner’s correlation energy 
(or recent slight improvements) also enters the local density approximation of 
density functional theory for atoms, molecules and solids. 

A significant by-product of Wigner’s studies of interacting electrons in the 
presence of a uniform positive charge background was his realization that, while 
at high densities the electrons are in a gas/liquid phase, at sufficiently low den- 
sities and low temperatures the phase of lowest free energy is a crystalline 
phase (now called the Wigner crystal) [4], in which the electrons localize them- 
selves near the lattice points of a closepacked lattice. After many decades of 
search such Wigner - (or closely related) crystals have been discovered for 
electrons on He-surfaces and for ions in non-neutral plasmas in strong mag- 
netic fields. Apart from its intrinsic interest, the Wigner crystal which owes its 
very existence to the Coulomb interactions between electrons, highlighted the 
importance of these interactions. 

In 1953 — 15 years after his last research paper in solid state physics — 
Wigner, together with Seitz, published a review entitled Qualitative Theory of 
Cohesion of Metals, an attempt to apply the lessons learned in their classic work 
on the alkalis, to metals in general [5]. As the word “qualitative” in the title 
suggests, this was a rather sketchy overview which did not introduce significant 
new concepts and has had little impact on the field. 

Another outgrowth of Wigner’s work on the electronic structure of the 
alkali metals was his work, with J. Bardeen, on the electronic structure of their 
surfaces. Because translational symmetry is lost in the presence of a surface, 
this problem is even more difficult than that of the bulk metals. Nevertheless 
Wigner and Bardeen correctly recognized the essential role of the exchange 
energy and were able to calculate, with semiquantitative success, the electronic 
work functions [6,7]. This constituted the first serious theoretical insight into 
the electronic structure of metal surfaces, which was of enormous technical 
interest in that age of the vacuum tube. The first significant further advance 
in this area did not occur until more than three decades later. 

An important contribution to solid state theory of a very different character 
was the elucidation of the symmetry properties of the wave functions in crystals 
(i.e. Bloch waves) by Wigner and collaborators [8]. This work, on the one 
hand, was a rather straightforward “exercise” in group theory, for someone with 
Wigner’s profound command of the subject; on the other hand it established 
the principles for all subsequent work dealing with the rich variety of different 
topologies of energy bands in crystals. 

During World War II Wigner played a leadership role in the physics and 
engineering of chain reactions (see Vol.V). A critical issue was the structural 
damage caused to reactor components (at the time called “Wigner’s disease” ) 
by neutrons and other fission products. In unpublished work Wigner estimated 
that, in a wide class of materials, an atom would be permanently displaced 
from its lattice site if an energy ~ 25eV was imparted to it. This “Wigner 
threshold” provided an important design guideline. Following the war, the field 
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of “radiation damage”, was extremely active for one or two decades, with much 
of the leadership coming from former members of Wigner’s wartime group, 
especially F. Seitz (see [9]). 

In conclusion, I would like to speak of the great influence of Wigner on the 
subsequent development of solid state theory in the US and elsewhere. In the 
US the field was initiated at approximately the same time, the early 1930’s, by 
three towering figures — J. C. Slater, J. H. Van Vleck and E. P. Wigner - in close 
and cordial contact with each other. Of these probably Slater was the broadest. 
Van Vleck’s main contribution was his brilliant quantum mechanical elucidation 
of magnetism in solids, for which — about 50 years later — he received the 
Nobel prize. In my view Wigner may have been the deepest, the first to really 
understand, physically and mathematically, the dynamics of electrons under 
the combined influence of the periodic crystal potential, the Pauli principle 
and their mutual Coulomb repulsion. 

All three had some excellent students. Three of Wigner’s students became 
highly influential, during the 1940’s and 50’s and beyond: John Bardeen, Cony- 
ers Herring and Fred Seitz. Bardeen and Herring who joined the Bell Tele- 
phone Laboratories in the middle 40’s played an essential role (together with 
W.Shockley, P.W. Anderson and others) in creating there what was, by com- 
mon consensus, regarded as the word’s leading solid state group. Bardeen, 
Shockley and the experimentalist W. Brattain developed the transistor in 1948, 
for which they received the Nobel prize. Seitz in 1940 published Modern The- 
ory of Solids, which many regard as the defining publication for this field. At 
the University of Pennsylvania, the Carnegie Institute of Technology and, for 
many years, at the University of Illinois he was the key person to establish 
three of the strongest early University groups in solid state physics. In the 50’s 
Bardeen moved from Bell Labs to the University of Illinois where, in 1957, 
he, L. N. Cooper and R. Schrieffer published the now famous BCS theory of 
superconductivity. In due course, this earned them the Nobel prize, the sec- 
ond for Bardeen. The writer of these lines (who, through a misunderstanding, 
missed becoming Wigner’s Ph.D. student) can attest from personal experience 
the inspiration which Wigner instilled in students and younger colleagues, by 
sharing with them his deep physical and mathematical insights into the nature 
of solids. 


II. Commentaries 


e (with F. Seitz), On the Constitution of Metallic Sodium, 1933 |1] 
e (with F. Seitz), On the Constitution of Metallic Sodium, 1934 [2] 
e On the Interaction of Electrons in Metals, 1934 [3] 


These three papers represent the beginning of the many-electron theory 
of metals. The problem was formidable. As Wigner knew well from his ac- 
quaintance with the quantum theory of small atoms and molecules, any se- 
rious estimates of the energy as function of the nuclear positions — which is 
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required for the calculation of lattice parameters, cohesive energies and elas- 
tic constants — had to go beyond mean field theories (Sommerfeld, Hartree, 
Hartree-Fock) and include the effects of dynamical correlations. The Rayleigh- 
Ritz variational method had resulted in extremely accurate calculations for the 
He-atom (Hylleraas) and the hydrogen molecule (James and Coolidge), but 
the effort increased very rapidly with the number of atoms involved, and the 
method was totally inapplicable to the case of an “infinite” metal. 

In the absence of a systematic quantitative procedure, the authors pro- 
ceeded largely heuristically [1]. They pointed out that in a metal each electron 
is surrounded by a neutralizing hole of total charge +e in the charge distri- 
bution of the other electrons. They calculated this hole in the Hartree-Fock 
(HF) approximation for the case of a uniform electron gas. (There is no such 
hole in the Hartree approximation). They noted, however, that in the HF ap- 
proximation the hole was due to statzstical correlations of electrons of parallel 
spin and that the important dynamical correlations due to the electron-electron 
repulsion, which affected electrons of both parallel and antiparallel spin were 
ignored. In the event they adopted the heuristic viewpoint that any electron, 
when located in a particular atomic cell, say j, excluded all other electrons 
from that cell, while the ions of the other cells were perfectly screened by the 
charges of the other electrons. This led them to an effective electronic potential 


V(r) = Vi (r), r E€ % 


where k denotes the cell, 92, is the volume surrounding ion k, and V.(r) is an 
effective potential for the valence electron in atomic sodium. They pointed out 
that the lowest energy wavefunction of the 3s-band in metallic sodium differed 
from the atomic 3s-function by the boundary condition 


n- Jy = Oonall Sk 


where S; is the boundary of cell k, and n is the normal to that boundry. Be- 
cause of the periodicity of the lattice, the Schrödinger equation needed to be 
solved in only one cell, say k = 0, and to do this the authors replaced the 
actual, nearly spherical, cell Rọ by a equivolumic sphere of radius r, (later 
known as the Wigner-Seitz sphere and radius). To allow for the exclusion prin- 
ciple they added to the energy Eo, so calculated, the mean Fermi-energy, Er, 
of a uniform electron gas of density n = (Er)-}. The total energy per ion, 
FEtot(rs) was taken to be the sum of Eo and Er and minimized with respect 
to r,.This yielded a lattice parameter, cohesive energy and compressibility of 
4.2 A, 25.6 kilocalories/mole and 1.6 x 1071! c.g.s. units compared to the ex- 
perimental values (at the time) of 4.23, 26.9 and 1 x 10~1!; the agreement was 
embarrassingly good. 

In retrospect, we can say that this first paper identifies all major issues and 
deals with them in a rough yet very judicious manner. The excellent agreement 
with experiment, which the authors themselves attributed to compensating er- 
rors, nevertheless showed that all major aspects of the problem had been rea- 
sonably treated. Paper [2] with the same title as [1], goes over the same ground 
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but more systematically. It examines at length the question of the Fermi energy, 
a relatively minor issue leading to a small correction of the free electron result 
obtained in [1]. The main contribution of this paper is the first serious attack 
on the problem of the correlation energy, the change in total energy due to elec- 
tronic correlations resulting from their mutual repulsion. The authors realized 
that this energy was due mostly to the fact that electrons of anti-parallel spin 
would be kept apart, since, even without repulsion, those with parallel spin are 
kept apart by the Pauli exclusion principle. They made the inspired Ansatz 
Eq. (22), [2] for the many-electron wave-function, notwithstanding its violation 
of strict anti-symmetry. Further inspiration was needed to deal with the func- 
tions Y, (y1, '** Yn; z1) occurring in this Ansatz. So the authors assumed that, 
in dealing with the spin-up electrons of coordinate zı, an appropriate “mean 
configuration” for the spin down electrons ym, was a close-packed lattice oc- 
cupied by pairs of the latter! With this brilliantly outrageous assumption, the 
correlation energy for a uniform electron gas was approximately calculated as a 
function of r,. When this was added to the appropriate Hartree energy and ex- 
change energy, and minimized with respect to r,, a lattice parameter of 4.75 A 
and cohesive energy of 26.9 kcal were obtained. (The lattice parameter is now 
in much poorer agreement with experiment than in [1].) 

In [3], Wigner returns once again to the crucial issue of the correlation 
energy. This time, in considering the function Yı (y1, -< Yn; z1), instead of as- 
suming that the yz, constitute a close-packed lattice of electron pairs, he makes 
the more rational Ansatz 


n 

Pily: Yni 21) = Wr(21) ( F `“ F(Ym — =) 
m=1 
Wy (z1) is a plane wave and the functions f,, expected to be small, short range 
and negative, are to describe the effect of the repulsion. (This is closely related 
to what much later became known as the Jastrow Ansatz.) This time the dis- 
tribution of the Ym is taken to be that in a Slater-determinant of plane waves. 
f, is calculated by second order perturbation theory on the assumption that it 
is small. 

Wigner who draws attention to the remaining uncertainties of the calcu- 
lation, reassures himself, and the reader, by examining the known correlation 
energy of small atoms and the low density limit of the metallic correlation 
energy. He points out that in the limit (r — oo) the kinetic energy of the elec- 
trons (arz?) becomes negligible compared to their potential energy (x rj‘) 
and that to minimize the latter the electrons will occupy the points of a close- 
packed lattice, later known as the Wigner lattice. The correlation energy in this 
limit could be simply, reliably and accurately estimated as E, = 0.292e?/r, per 
electron. 

The theoretical estimates for lattice parameter and cohesive energy result- 
ing from this paper’s correlation energy were 4.62 A and 26.1 kcals/mole. The 
agreement with experiment was similar to [2]. 
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Nearly 60 years after [3], Wigner’s calculation of the correlation energy has 
been improved by only a few percent. Current theoretical values for the lattice 
parameter and cohesive energy are 4.07 A and 25.8kcals /mole. The best ex- 
perimental values are 4.22 A and 26.1 kcals. The agreement with experiment of 
the lattice parameter is significantly better than it was in [3] (today theoretical 
lattice parameters are generally accurate); the agreement of the cohesive energy 
is similar to what it was then. (In view of the considerable error of the lattice 
parameter in, [3] the exact agreement of the cohesive energy (26.1 kcal/mole) 
must be regarded as coincidental.) 


e On the Structure of Solid Bodies, 1936 [10]. 


This is a very elementary, qualitative overview of the principal classes of solids, 
molecular, metallic, valence and ionic. It was written shortly after Wigner and 
his collaborators had clearly understood the nature of metallic cohesion. ‘The 
cohesive energies of ionic and molecular solids (like NaCl and He) had been pre- 
viously understood, and those of valence solids (like Si) had just been clarified 
by the work of Pauling and others on the chemical bond. 

In the introduction Wigner presents his view of solid state physics, in rela- 
tion to physics as a whole. According to Wigner physics research proceeds on 
two fronts. He sees one front, in the main the physics of nuclear phenomena, 
as leading to “important changes in our fundamental concepts”. The efforts 
on the other front, including the physics of solids, are “...directed toward the 
deepening and broadening of our knowledge of phenomena which, we believe, 
can be understood on the basis of existing concepts and theories. Doubtless the 
second front is of less importance. It rarely leads to fundamental discoveries in 
physics proper...” Would Wigner still maintain this view after the discovery 
of electron-pairing superconductivity, the Aharonov-Bohm effect, universality 
classes in critical phenomena, the quantum Hall effect, etc.? 


e(with J. Bardeen), Theory of the Work Function of Monovalent Metals, 1935 
[6] 


This paper, [6] and its sequel, [7], by J. Bardeen on the surface dipole bar- 
rier, are the foundation of the theory of electronic structure of metallic sur- 
faces. The modest introduction, “It is perhaps not quite superfluous to have 
an approximate treatment which merely shows how the quantity in question is 
determined- - -”, is followed by a clarification of the issues involved. In current 
terminology, the work function y can be written as py = eD — u, where eD is the 
surface dipole barrier and u the internal chemical potential. Paper [6] identifies 
the two parts and estimates the quantity u; a semiquantitative estimate of D 
is provided in [7]. 

The calculation of u takes advantage of the previous calculations by Wigner 
and Seitz [1-3] of the cohesive energies of the alkali metals. For the present 
purpose the energy of an alkali metal is regarded as a function of the number 
of ions, n;, and electrons, ne: E = E (ni, ne). The total energy, of a neutral alkali 
crystal, the subject of the earlier papers is E(n;,n;) and the workfunction is 
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the energy required to remove an electron, was taken to be y = E(n,,n;_ı) — 
E(n;,,n;) = [dE(n;,ne)/Oneln.=n,. The agreement with existing experimental 
data was, somewhat deceptively, excellent when D was neglected, but it did 
establish that for these metals D was small and that all main contributions to 
y (kinetic energy, exchange energy,...) had been properly included. 


e (with H.B. Huntington), On the Possibility of a Metallic Modification of Hy- 
drogen 1935 [11] 


For Wigner, the chemist, it was natural — after the work on the alkali metals — 
to consider hydrogen, which likewise lies in group I of the periodic table. (Note 
that this paper was published in the Journal of Chemical Physics.) At the 
same time the different character of solid hydrogen was obvious: At standard 
temperature and pressure the alkalis crystallize in Bravais lattices, one atom per 
unit cell, and are metallic, while hydrogen forms an insulating molecular crystal, 
bound by Van der Waals forces between the hydrogen molecules. Following an 
idea of J. D. Bernal, the authors explored the question if, under sufficiently high 
pressure, hydrogen would undergo a transition into a metallic phase. 

As the authors state, “The calculation itself was a rather mechanical appli- 
cation of the principles outlined elsewhere,” i.e. earlier papers by Wigner and 
Seitz (and one by J.C. Slater). But the questions raised and the, mostly tenta- 
tive, answers offered by the authors - specific to hydrogen — were, in retrospect, 
the most pertinent. What the calculation showed beyond doubt was that, at 
zero pressure, an assumed metallic phase is much less stable (by about 40 kcal) 
and many times denser than the known molecular phase. The interesting ques- 
tion whether under pressure — and, if so, at what pressure — the metallic phase 
becomes stable is raised but not answered. Unavailable, at the time, was the 
energy as a function of volume of the molecular phase, which of course, is also 
needed for this purpose. The authors do, however, quote a lower estimate of 
250 kbar. Today the metallic phase has still not been conclusively identified in 
the laboratory. But both theory and experiment suggest very strongly that it 
should be stable at pressures of about 2-4 Mbar. Finally the authors advance 
chemical reasoning to suggest the possible existence of other allotropic forms, 
specifically layer structures. Later theoretical research has confirmed that the 
energy differences between several “exotic” allotropic structures are quite small. 
Finally, there is today good reason to believe that the hydrogen on the planet 
Jupiter is indeed metallic. 


e (with L.B. Bouckaert and R.Smoluchowski), Theory of Brillouin Zones and 
Symmetry Properties of Wave Functions in Crystals 1936 [8] 


This paper presents the group theoretic implications of crystal space groups 
for the symmetry properties of energy bands and eigenfunctions, and for their 
topological properties as function of k [12]. A representation of the space group 
is labeled by k?, and by the representation of the small point group, which 
leaves k invariant. A useful concept is the “star of k”, the totality of all dis- 
tinct k-vectors in the principal Brillouin zone arising from the original k by 
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application of the elements of the full point group. Energy bands are char- 
acterized by the representations of the small groups associated with k values 
at symmetry points in the Brillouin zone. The topology of the energy bands 
near such symmetry points is elucidated. Compatibility rules are established | 
between possible representations at points of high symmetry in k-space (e.g. 
(0,0,0)) and on connected manifolds of lower symmetry (e.g. (k,0,0)). From 
this the question of “sticking together” of bands (e.g. the pz, py, pz — bands of 
a composite p-band at k = (0,0,0)) is clarified. Any physicist who has seen 
the tousled “spaghettis” (E„(k)) of typical band structure calculations (e.g. 
hybridized s-d bands) appreciates the value of the group theoretic ordering 
principles so clearly presented in this paper. 


e On the Constant A in Richardson’s Equation, 1936 [13]. 


The standard expression for the thermionic emission current from a metal was 
(and is) Richardson’s Equation 


i = AT?e WIRT, 


where W is the work function of the metal and - in simple models — A has the 
universal value of 120 amp cm”?K?. In practive the form of this expression 
is well confirmed by experiment but fitting experimental data gives values of 
A that differ from the canonical “120” by factors ranging typically from 107! 
to 10+!. The simplest extensions of Richardson’s model are reflection effects, 
replacing A by A(1 — r), obviously unable to account for high values of A; and 
temperature dependence of the work-function, W = Wo + y(T - To), resulting 
in A — Ae~7/*. In this paper Wigner estimates the temperature dependence 
of W due to thermal expansion and, while unable to make quantitative esti- 
mates obtains changes which are of the right order of magnitude to account 
for the experimental data. However, since several other important effects are 
ignored (e.g. temperature dependent spacing between surface layers) little can 
be concluded except, as the author states, “that it is purely accidental if the 
(experimental) constant A in Richardson’s equation... has the value 120 amp 
em—*k-2”, 


e Effects of the Electron Interaction on the Energy Levels of Electrons in Metals, 
1938 [4] 


Here Wigner continues his considerations of interaction effects of metallic elec- 
trons in a positive uniform charge background begun in earlier papers, espe- 
cially [3]. 

He first addresses two issues. Does a uniform interacting electron gas be- 
come ferromagnetic? He concludes that, while without exchange and correlation 
effects the answer is no, for all electron densities; when exchange is included 
the ferromagnetic state becomes stable at low densities (r, > 5.47); and when 
correlation effects are also included it appears likely that the non-magnetic 
state is again stable for all densities. 
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Secondly he addresses the question of the density of low-lying excited 
states, which determines the low-temperature electronic specific heat. In sim- 
ple Hartree/Sommerfeld theory, which ignores exchange and correlation, the 
density of states at T = 0 is energy independent in accord with the T! be- 
havior of the electronic specific heat. However, when exchange is included the 
level density tends to zero with the excitation energy, and — as had been noted 
by J. Bardeen — the agreement with the experimentally linear specific heat is 
lost. Wigner discusses how correlation effects might save the day but does not 
actually solve the problem. 

Once again [3] he is led to consider electrons at low densities, where corre- 
lation effects become large, and, in particular, the electron lattice (today called 
the Wigner lattice) which eventually becomes the stable phase. Because of his 
concern with specific heats he now considers also the vibrations of the lattice. 
But the problem of the wrong T-dependence of the specific heat remains. Later 
many-body theories showed that the electron-electron interaction in the gas 
phase (relevant to physical metals) is effectively screened at large distances, 
which restores the correct linear T-dependence of the specific heat. 


e Qualitative Theory of the Cohesion in Metals, 1953 [5] 


This paper, presented 15 years after Wigner’s last published research paper 
on metals [4], offers qualitative comparative ideas about the cohesion of var- 
ious classes of metals (alkalis, alkaline earths, transition metals,---). Metallic 
cohesion is, as much as possible, related to the atomic energy levels of the con- 
stituent element. However, Wigner did no further work in this area to firm up 
these ideas. The important work of J. Friedel and co-workers in the 1960’s on 
transition metals, using tight binding models, is in a similar spirit. 


e (with F. Seitz), The Effect of Radiation on Solids, 1956 [9| 


Radiation effects — usually damage - in solids became a practically extremely 
important issue in connection with nuclear reactors, some of whose components 
are subject to intense radiation by neutrons and other particles. This paper 
explains in simple terms how radiation can permanently displace crystal atoms 
from their equilibrium positions. It concludes by speaking of the “promise to 
provide us with useful information about the solid state in general”. In the 
writer’s view this promise has since been fulfilled on a modest scale. 
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Previous developments in the theory of metals may be 
divided clearly into two parts: that based principally upon 
the hypothesis of free electrons and dealing with conductiv- 
ity properties, and that based upon calculations of valence 
forces and dealing with the chemical properties. In the 
present article an intermediate point of view is adopted and 
the free-electron picture is employed in an investigation of 
chemical properties of metallic sodium. The assumption is 
made that in the metal the K and L shells of an atom are 
not altered from their form in the free atom. The properties 
of the wave functions of the electrons are discussed quali- 
tatively, first of all, and it is concluded that the binding en- 


I. 


HE investigations which have been carried 
out so far on the constitution of metals by 
quantum mechanics may be divided into two 
classes, the work on conductivity and related 
phenomena, carried out chiefly by Bloch, Peierls, 
Nordheim and Brillouin! are mainly based on the 
hypothesis of free electrons? and are concerned 
with the interaction between the electronic mo- 
tion and the vibrations of the lattice, which is 
responsible for the electric resistance. The works 
of the other class? are mainly concerned with the 
chemical properties and crystal structure of the 
metals and are based on calculations of valence 
forces. They encounter great mathematical difh- 
culties because the application of the usual 
methods to calculate valence forces becomes more 
and more difficult as the number of atoms in- 
creases. 
The present work intends to take an inter- 
mediate point of view by applying the free elec- 
tron picture but aiming at a calculation of chem- 


1 Cf. the comprehensive treatment by L. Brillouin, Die 
Quantenstatistik. Berlin, 1932. 

2 Cf. W. Pauli, Zeits. f. Physik 41, 81 (1927); A. Sommer- 
feld, Zeits. f. Physik 47, 1 (1928). 

2 J.C. Slater, Phys. Rev. 35, 509 (1930); E. A. Hylleraas, 
Zeits. f. Physik 63, 771 (1930); and especially H. S. Taylor, 
H. Eyring, A. Sherman, J. Chem. Phys. 1, 68 (1933). 


ergy will be positive even when the Pauli principle is taken 
account of. This is followed by a quantitative investigation 
of the energy to be associated with the lowest state. First of 
all it is shown to what extent the present picture takes ac- 
count of the interactions of electrons with both parallel and 
antiparallel spins, and to what extent remaining effects 
may be neglected. Next a Schroedinger equation is solved 
in order to determine the lowest energy level for various 
values of the lattice constant. To this a correction is made 
to account for the Pauli principle and from the result the 
lattice constant, binding energy and compressibility are 
calculated with favorable results. 


ical properties of metallic sodium such as lattice 
constant, heat of vaporization, compressibility, 
etc. The method of calculation is the same one as 
that proposed by Hund for molecules* and more 
recently applied by Lenz and Jensen‘ to ionic 
lattices, and by Lennard-Jones and H. J. Woods® 
to two dimensional metallic lattices. The elec- 
trons are assumed to move freely in a potential 
field and their interaction is supposed to be con- 
tained to a large extent in this field, much as in 
Hartree’s method of the self-consistent field 
which is actually the field adopted in the calcu- 
lations of Lenz (not in ours). The initial assump- 
tions which one makes about the statistical con- 
nections of positions of different electrons are 
necessarily rather rough in this picture and 
should be improved afterwards. 


Il. 


We assume first that the electrons in the K 
and L shells are not affected by the metallic 
bond and their wave functions the same as in the 


‘F, Hund, Zeits. f. Physik 40, 742 (1927) and applica- 
tions of this point of view to crystals, Zeits. f. Physik 74, 1 
(1932), 

5 W. Lenz, Zeits. f. Physik 77, 713 (1932); H. Jensen, 
Zeits. f. Physik 77, 722 (1932). 

€ J. E. Lennard-Jones and H. J. Woods, Proc. Roy. Soc. 
A120, 727 (1928). 
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free state. This is justified since the corresponding 
wave functions practically vanish in half the 
interatomic distance. For the valence electron, 
however, such an assumption is quite out of the 
way, since the maximum of the corresponding 
wave function is (quite necessarily, as we shall 
see) just about half way between two atoms. 
Contrary to the conditions which exist in the 
free state, however, the wave function must not 
drop to zero after the maximum but can con- 
tinue periodically through the whole crystal. It 
will therefore be much smoother than the wave 
function of the free atom, and the kinetic energy 
of the corresponding state will consequently be 
much smaller than that of the electron in the free 
atom. The potential energy, on the other hand, 
will be negatively larger in the lattice than in the 
free state because outside of the above-men- 
tioned maximum the wave will not be under the 
influence of the nucleus considered originally, but 
under that of the next nucleus of the lattice, 
which is nearer. The electron with the wave func- 
tion just described will have a larger negative 
energy than that in the free atom and we con- 
sider this to be the essence of the metallic state. 

Of course, the wave function which one obtains 
by continuing the atomic wave function period- 
ically in the lattice is not the real wave function 
of the free electron in the lattice, but the energy 
of the latter will be even smaller than that of the 
former. We shall try to find an approximation to 
the real wave function by actually solving a dif- 
ferential equation. 

It must be added that not all the free electrons 
can be in the state given above, because of the 
Pauli principle. This reduces the magnitude of 
the metallic bond, because the electrons must 
have an additional kinetic energy, which is known 
as the zero-point energy of a Fermi gas. One sees 
easily, however, that this additional energy is 
smaller than the reduction of the kinetic energy 
which was obtained by continuing the wave func- 
tion periodically through the whole lattice, so 
that there certainly remains a positive amount 
for the metallic bond. 


III. 


First, we shall calculate the energy of the free 
electron in the lowest state. We shall do this by 
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numerically solving a Schroedinger equation. It 
will not be necessary to solve it for the entire lat- 
tice, because it will have the same symmetry as 
the crystal and hence will merely repeat itself a 
great number of times. Because of this symmetry, 
the derivative of the wave function at every 
crystallographic symmetry plane will be zero 
perpendicular to this plane. This will be used as a 
boundary condition. The crystallographic sym- 
metry planes which we shall use in this way bisect 
perpendicularly the lines connecting the second 
nearest atoms. If we draw lines connecting the 
nearest atoms and consider the planes bisecting 
these perpendicularly, we have every atom sur- 
rounded by a truncated octahedron (Fig. 1). 
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The middle points of the planes of the latter 
possess such symmetry (Se) that the derivative 
of the wave function must vanish at these points 
in every direction. It will be quite a good ap- 
proximation to replace the polyhedron of Fig. 1 
by a sphere of equal volume, and to take as 
boundary conditions that the derivative of the 
wave function vanishes at the boundary of this 
sphere. 

The determination of the potential function to 
be used inside this sphere is more difficult. It 
would be quite out of the way to use Hartree’s 
method as the density of electrons in the greatest 
part of the domain is very small. This funda- 
mental difference between metallic and ionic 
lattices was already pointed out by Lenz.’ If we 
assume that two electrons are never around the 
same ion, every ion may be supposed to be sur- 
rounded by a spherical electron cloud which will 
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exactly’ cancel its potential outside of the sphere. 
Hence it seems to be the best simple assumption 
to take the potential as that of the ion inside the 
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sphere mentioned above. The knowledge of this 
potential will allow us to set down the differential 
equation for the free electron which will be solved 
with the boundary condition that the derivative 
vanish at the boundaries of a sphere. The solu- 
tion will be obviously spherically symmetric 
about each atom, which is of course not true for 
the actual wave function of the free electron but 
is a consequence of our approximations. It is not 
very far from the truth, however, since the wave 
function will actually have the highest crystallo- 
graphic symmetry (0”) which is not very far 
from spherical symmetry, for to every direction 
there are not less than 47 other equivalent 
directions. 

The justification of the assumption that two 
electrons are never around the same ion arises 
from two sources Consider first the statistical 
connections between the positions of two elec- 
trons in an ideal Fermi gas. The complete wave 
function will be a determinant 


(nd - ; : 
Yıln) Yaln) +++ palin) 


where the numbers in parentheses represent the 
Cartesian and spin coordinates of the corre- 
sponding electrons and the y are the wave func- 
tions of the different states.® In order to obtain- 
the statistical relations between the positions of 
two electrons we have to square (1) and integrate 
it over the coordinates of all electrons except 
those considered, which will be taken as 1 and 2. 
Because of the orthogonality relations of the y, 
the result will be, apart from a constant, 


De Zr. |? yal)? — DADUDA2]. 
k=l Al (2) 
This still contains the spin coordinates sı and Se 


of 1 and 2 and reads more explicitly, if the edge 
of the cubic-shaped crystal is L, 
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There are really two questions to discuss: the 
statistical connection between electrons with 
antiparallel spin (sı= —o2) and between those 
with parallel spin (0; =o2). For a pair of the first 
kind the second term of (2a) vanishes so that they 
are statistically independent. For two electrons 
with parallel spin, on the other hand, we have to 
evaluate the sum of (2a) after having omitted the 


probability will not depend on the direction. 
With these conditions, (2a) becomes 
D 2 (1 — etril vi— pi) da (3) 
Yıvavya M1 M2 
Here the summation over re, vs, ua, us can be car- 
ried out at once. The limitation on the 1, ve, vs 
and u, ua, us being 


spin factors. We shall denote the distance of the Verve yt ys te? + ys? Sv? 
two electrons by 7, and may assume that the line | »=(3n/8r)} (4) 
joining them lies in the X direction, since the it gives 

Pin= E ZZ rore) (P p1 eri rL), (5) 


rı=—y py=—y 


Now the summation over », and p, after dividing by r?v?(4»?—1)?/9 gives for the probability of the 
electrons with parallel spin being a distance r apart 


? There is nothing like exchange forces in our picture. 


8 This consideration is contained implicitly in the work of Uhlenbeck and Gropper for the case of only slightly 
degenerated gases, Phys. Rev. 40, 1029 (1932). We are interested in the case of complete degeneracy, however. 
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Here d’=wt/3trt! and u=L'/n is the atomic 


volume. The function P(r) is sketched in Fig. 2, 
it vanishes for r=0 and approaches 1 as r be- 
comes large compared with the lattice constant. 
It attains its half-value for r=1.79 d’ or 0.460 d 
for a body-centered lattice with a cube edge d. 
The radius of the sphere described above is 
about the same, namely (3r)!d=0.492 d. So we 
see that two electrons with parallel spin will be 
very rarely at the same ion, simply in conse- 
quence of the exclusion principle. This will be 
also true for a Fermi gas subject to a periodic 
potential, as the potential does not materially 
alter this argument. 

It remains to investigate the case of anti- 
parallel spins somewhat more closely. As it is not 
possible that three electrons have antiparallel 
spins, the probability of three electrons being at 
the same ion will be very small anyway. For two 
electrons with antiparallel spins and without 
interaction, however, there is no statistical con- 
nection of the positions. If one should take the 
interaction into account, it would turn out, how- 
ever, that there is a connection of such a kind 
that they are but rarely in the neighborhood of 
each other. This is already indicated in the well- 
known solution of Hylleraas® for He and in the 
similar solution of Bethe!® for the negative hydro- 
gen ion. These solutions also show that the con- 
nection is of such an order of magnitude that the 
choice of potential function, which corresponds 


3 cos (rr/L) sin (2rvr/L) —2r sin (rr/L) cos (2rvr/L) | 
v(4v?—1) sin? (rr/L) ) 


1 (3 
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sin (r/d’) —(r/d’) cos (r/d’)\ ? (6) 
(r/d) ) 

to our rather rough picture of the metallic bond, 
is justified to some extent. It must be admitted, 
however, that the lower limit of the energies of 
the free electrons, which we calculate in this way, 
will certainly give too large a binding energy for 
the lattice, as all of the electrons will not be at 
different ions with certainty, and also because 
the terms of Hylleraas and Bethe just discussed 
will increase the kinetic energy above the value 
which we obtain by our boundary conditions. We 
shall not take up this question in more detail 
this time as it is deeply connected with the inter- 
action problem of the electron and the justifica- 
tion of the notions of the free and bound electrons 
and we hope to return to it at another time. 


IV. 


The calculation of the wave function inside the 
proper spheres of the ions is very simple in 
principle. The potential function of Prokofjew!! 
was used for the purpose. This was obtained by 
Prokofjew following a method of Kramers! in 
which one employs experimental values of the 
terms of Na. The differential equation for the 
radial function R=r/(r) is 


— (h?/82?m)(2R/dr?) + V(r)R=ER(r) (7) 


and in units of the Bohr radius of H, the quantity 
QO(p) = —aop?V /e?, is approximated for various 
intervals by parabolas, as follows: 


p=0.00 to 0.01 Q=1I1p 


0.01 0.15 =—26.4 p?+11.53 p—0.00264 
0.15 1.00 =— 2.84 p+ 4.46 p+0.5275 
1.00 1.55 = -+ 1.508 p?— 4.236 p+4.876 
1.55 3.30 = 0.1196p?+ 0.2072p+1.319 
3.30 6.74 = 0.0005p?+ 0.9933p+0.0222. 
6.74 œ =p 


The boundary condition dy/är=0 requires that at the boundary R should satisfy 
dR/dr=R/r. (8) 


° E. A. Hylleraas, Zeits. f. Physik 48, 469 (1928). 
10 H. Bethe, Zeits. f. Physik 57, 815 (1929). 


n W, Prokofjew, Zeits. f. Physik 58, 255 (1929). (Note: 
Prokofjew’s table of Q(e) (p. 258) contains obvious errors 


in two places, one in decimal point and one in sign. These 
were easily detected by the continuity conditions. The 
form given here is corrected.) 

2H. A. Kramers, Zeits. f. Physik 39, 828 (1926). 
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P(r) 
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Instead of finding the energy value £ for different 
radii of the sphere, the radii corresponding to 
different energy values were determined. Thus an 
arbitrary energy value E was taken and the cor- 
responding wave function was obtained from (7) 
using the method of finite differences employed 
by Prokofjew. The calculation was started at 
r=0, however, so that every calculated wave 
function could be used. For r=0 the radial func- 
tion R vanishes and the solution up to 7=0.025 
was calculated by means of a power series in 7”. 
After this the method of finite differences was 
employed, first with differences of 0.005 and then 
with larger ones when allowable. The largest dif- 
ference employed was 0.32 (r>4.6). The wave 
function had practically no dependence on E in 
the neighborhood of the origin so that it was not 
always necessary to repeat this part of the calcu- 
lation. As a check, the energy of the electron in 
the free atom was also determined, the calculated 
value lies between 0.3820. and 0.3800 Rydberg 
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units, while the experimental value is 0.3778. In 
Fig. 3, the wave function of the free atom and the 
wave function for E=0.500 are plotted. The 
numerical tables will be published at another 
time. 

_ After having the wave function, it was easy to 
determine the radius of the sphere, for which the 
boundary condition (8) is satisfied by drawing 
the tangents to R from the origin. The figure 
shows, that the boundary conditions are satisfied 
for two different radii, so that every numerical 
integration yields two points of the E(r) curve, 
which gives the energy of the most strongly 
bound free electron as a function of the lattice 
constant d= (87/3)ir. In Fig. 4 the E(r) curve is 
given (lower line), the unit of energy being the 
ionization energy of H. For very large r it ap- 
proaches the ionization energy of atomic Na, 
possesses a Minimum around r= 3, and rises again 
for smaller values of 7. This latter behavior is due 
to the fact that a further compression of the lat- 
tice would push the valence electron inside the 
closed Z shell, which of course requires energy. 
The lattice, unlike a similar H lattice,!? would be 
stable, therefore, even without taking into ac- 
count Fermi statistics. 

The calculation of a wave function took about 
two afternoons, and five wave functions were 
calculated on the whole, giving the ten points of 


18 Cf, E. A. Hylleraas, reference 3. 
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the figure. The points of the wave function of 
Fig. 4 are marked by a cross. 

Another point which should be mentioned is 
that concerning the change in energy of the inner 
shells. The change is not due chiefly to the change 
of the boundary conditions as discussed above 
for the valence electron. This, of course, does in- 
crease the binding energy of these electrons, but 
only by a very small amount. A greater change 
arises from the increase in the probability of the 
valence electron being inside the L-electrons, 
because of the material change of the normaliza- 
tion factor. This decreases the binding energy of 
the inner electrons and hence lowers the heat of 
vaporization. A calculation of this effect has been 
made and shows that the decrease in binding 
energy is 0.008 Rydberg units per atom, or 2500 
small calories per mole. This was obtained by 
evaluating the change of the potential of the 
inner electrons in the field due to the valence 
electron for the free and bound atom. The inner 
charge distribution was taken to be that given 
by Hartree.* 

V. 


The last question we have to investigate is 
concerned with the additional energy of the other 
free electrons due to the Fermi distribution. This 
energy was calculated by the simple Fermi 
formula and it gives a mean additional energy for 
every electron 


a 1 OW é a1 
10m \8r Vi 80rm =) r? 
or (97/10) (3/27) 7? if the energy is measured in 


Rydberg units and r in Bohr units. As a matter of 
fact, this formula is valid only for free electrons 


* Hartree, Proc. Camb. Phil. Soc. 24, 111 (1927). 


(9) 
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and it certainly gives too large a value for bound 
electrons. The fact that the energy differences for. 
bound electrons are smaller than for free electrons 
was shown first by Bloch.!t It also follows from 
the following argument. Let the wave function of 
the electron with the lowest energy be yon 
(x, y, z), which is invariant with respect to an 
addition of an identity period to the coordinates. 
This invariance is not possessed by the wave 
functions of the other free electrons, and that 
with the quantum numbers rı, vz vs, will be 
multiplied by ertrıd/L, garind/Z and by etriradiZ if 
x, y, or z are increased by d, respectively. Now 
Yooo gives the lowest possible energy of all wave 
functions, which are orthogonal to the wave 
functions of the L and K shells. For the wave 
function with the quantum numbers 1, v2, v3 this 
is true if we compare it only with functions 
which have the same symmetry character, i.e., 
are multiplied by the same factors if one re- 
places x, y or z by x +d, y+d or z+d, respectively. 
There is, however, the function 


Yr var T erin et aye r32) TY oog(x, Yı 2), (10) 


which has all the required properties and the 
energy of which differs from that of Yoo only by 
(2/2mL?) (vı?+va?+v3?), the Fermi energy for free 
electrons. This is easily seen upon calculating 
(Waran Hervan) for (10) and remembering 
that Yoo may be assumed to be real. The energy 
of the real wave function with the quantum 
numbers vı, v2, vs is certainly less than that of 
(10) and so the average additional energy for the 
free electrons in higher states is also certainly less 
than (9). Nevertheless (9) was adopted in the 
subsequent calculation and the corresponding 
expression added to the energy of the lowest 
electron E(r). The result is given in Fig. 4 
(upper line). It is probably true that the fact that 
(9) gives a too high value largely compensates 
the error which was made by the assumption of 
the free electrons, as discussed in Section ITI. 


VI. 


The upper curve in Fig. 4 gives at once all 
quantities we desire to calculate. The posi- 
tion of the minimum gives the radius of the 
sphere for which the energy is the smallest and 


14 F, Bloch, Zeits. f. Physik. 
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when multiplied by (87/3)! it should give the 
lattice constant for the absolute zero point. 
Similarly the depth of the minimum below the 


line of the energy for the free atom should give, _ 


after subtracting the correction for the energy 
gained by the inner shells (2.5 kilo cal.) the 
energy difference between the gas and the solid 
state (i.e., the heat of vaporization per atom) in 
Rydberg units at the absolute zero point. Finally 
the curvature at the minimum fm is in a simple 
connection with the compressibility: the energy 
change for a linear compression in the ratio a per 
volume v is vor (3&)?/2, where « is the compressi- 
bility, and, on the other hand, itis 3 %_,20?d?E (tm) /dr? 
when calculated from the figure. This gives 


k= (1/9) (%m?/ Vo) (PE(rm)/d1*). (11) 


The quantity d?E/dr? was calculated as if the 
lower curve were linear at 7,, and all the curva- 
ture arose from (9), which is approximately true 
according to the figure. The final values obtained 
for the three quantities d, A and x are d=4.2A, 
\=25.6 kilo cal./mol, «=1.6X10-" c.g.s. The 
depth of the minimum of the lower curve below 
the energy of the free atom is 88.5 kilo cal.; 
and the depth at the point where the upper 
curve has its minimum is 70.4. The Fermi 
correction at this point is 42.3 kilo cal. In 
order to have a fair comparison with experiment, 
the experimental values for. these quantities must 
be extrapolated to the absolute zero point. It 
must be remembered, however, that we did not 
treat the motion of the nuclei by quantum me- 
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chanics and in consequence, the extrapolation 
should be done in such a way as to neglect the 
quantum effects. This was done by taking the 
values for room temperature and correcting them 
linearly. The three values for room temperatures 
arel5: d=4.30A; \=26.00 kilo cal./mol; «=1.67 
X10-" c.g.s. The coefficient of thermal expan; 
sion is 6210-5; the corrected value of X ig 
determined by adding the difference in heat 
content of solid and gas ((6—3) cal./deg. x 300 
deg. =900 cal.) to the room temperature value 


.of X; and the value of x at 0°K was obtained 


by extrapolating values given along with the 
above. The final values are: d=4.23A, \=26.9 
kilo cal./mol; «~1.0*10-" c.g.s. The theo- 
retical values compare favorably with these, 
partly, without doubt, as a consequence of com- 
pensating errors. 

The work.on sodium is being extended with 
particular reference to a more exact determina- 
tion of the distribution of energy levels in the 
neighborhood of the lowest one. Moreover, the 
corresponding calculations on Li, K, Rb, by 
using Hartree’s and Hargreaves’ fields,” are 
being undertaken by one of us. ) 


16d: P. P. Ewald, Hand. d. Phys. XXIV, 331. 
`: J. Sherman, Chem. Rev. 11, 93 (1932). 
_«t Landolt Bornstein, Erster Ergaenzungsband 5 auf., 
25. Int. Crit. Tables III, 47. 

16 Int. Crit. Tables II, 461. 

17 We wish to offer our thanks, at this time, to Professor 
J. C. Slater and hence to Dr. Hartree, for the use of the 
unpublished tables of Kt. 
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The present work represents an extension to a previous 
development by the same authors, on the theory of metallic 
sodium. In the first part of this paper a completely self- 
consistent solution of Fock’s equations for the sodium 
lattice is carried through indirectly, this being the approxi- 
mation in which one-electron functions are employed. 
The question of the correlations between electrons with 
parallel spin is investigated quantitatively and the Fermi 
“zero-point energy” is calculated using the proper effective 
field. The results show that the electrons behave almost 


I. THE POTENTIAL INSIDE THE LATTICE 


In a previous paper by the same authors a 
method of calculating the binding properties of 
metals was developed! and applied to sodium. 
The procedure employed was essentially one of 
solving the Fock system of equations? for the 
valence electrons (i.e., the system of equations 
to which the Schrédinger equation reduces when 
one electron functions are assumed). This solu- 
tion did not proceed from a formal investigation 
of Fock’s differential equations, but was devel- 
oped indirectly under the guiding principles of 
the picture afforded by the free electron theory. 

To begin with, the lattice was subdivided into 
polyhedrons of equal size and form, which we 
shall call s-polyhedrons, each of which surrounds 
one ion lying in its center, and is bounded by 
the planes which bisect, perpendicularly, the 
lines connecting the corresponding ion with its 
14 neighbors (the alkali metals form body 
centered lattices). Since these polyhedrons 
closely resemble spheres, they may be replaced 
by spheres of equal volume for many purposes 
and these we shall designate as s-spheres, their 
radius being r,= (3u)/47)?, where vo is the atomic 
volume. 

Concerning the nature of the electronic states 
of the lattice, we know that there will be bands 
of allowed levels, no more than two electrons 


2 E, Wigner and F. Seitz, Phys. Rev. 43, 804 (1933). 
2 J.C. Slater, Phys. Rev. 35, 210 (1930); V. Fock, Zeits. f. 
Physik 61, 126 (1930), 


exactly as if they were entirely free, the binding energy 
being 9 kg cal and the lattice constant 4.86A, as compared 
with the observed values 26.9 kg cal and 4.23A. To com- 
plete the picture, the correlations between electrons with 
anti-parallel spins are investigated in the latter part, since 
these are not included in the Fock picture. A general 
discussion of this question is presented and a quantitative 
treatment of its effect is made which yields a new binding 
energy of 23.2 kg cal and a lattice constant of 4.75A. The 
source of the remaining discrepancy is discussed. 


occupying each level because of the restrictions 
imposed by the Pauli principle, and that the 
lowest state in the lowest band will possess the 
symmetry of the lattice. From this it follows 
that its normal derivative will vanish on the 
boundaries of the s-polyhedrons and to obtain 
its wave function, it is only necessary to solve the 
Schrédinger equation within one polyhedron, by 
using a suitable effective field and this boundary 
condition. Approximate wave functions of other 
electrons may be obtained from this by multi- 
plying it with factors of the form 


errilnztveytvaz)/D, (1) 


where vi, v», v3 are positive or negative integers 
and L is the length of the crystal-edge, corre- 
sponding to free electrons with nonvanishing 
eigenvalues of momentum. _ 

In order to find the effective potential field 
inside of an s-polyhedron, we first replace them 
by s-spheres, correcting later for the small error 
arising from this. For an electron at a given 
point the field consists of three parts: first, the 
potential arising from the ion at the center of 
the s-sphere, second, the potential of the other 
ions; and third, the potential arising from the 
other free electrons. Instead of making a direct 
calculation of each of these in the order given it 
is found far more advantageous to begin with 
an investigation of the third for it happens that 
simplifying assumptions concerning the nature 
of this interaction may be fully justified at a later 
stage and allow a simple treatment of the first 
two. 
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The potential of the electron under considera- 
tion arising from the other electrons is equal to 
that arising from a continuous electron fluid of 
density which is e times the probability of having 
an electron at the corresponding point of the 
lattice. If we assume that all wave functions 
arise from one by multiplication with a factor of 
the form (1), the probability distribution will be 
the same for each. Moreover, since it will be 
spherically symmetric within each sphere the 
effective field of each ion will be neutralized 
outside of the sphere in which it is contained, and 
the only field which remains to be taken into 
account for our chosen electron is that arising 
from the charge within the s-sphere which con- 
tains it. Since this will give rise to a spherically 


symmetric field and a spherically-symmetric | 


wave function, the assumption of spherical sym- 
metry is completely self-consistent. The error 
introduced by’ replacing the s-polyhedrons by 
spheres is calculated in appendices 1 and 2 and 
turns out to be negligibly small. 

In the previous work only the field arising from 
the ion in each s-sphere was taken into account 
and it was assumed that correlations were such 

„that other electrons did not penetrate the given 
sphere. Under this assumption, the wave function 
“turned out to be practically constant for more 
-than 90 percent of the volume of the sphere 
(cf. Fig. 1), a fact which allows us to take the 
other electrons into account very simply and 
correct for the inaccuracies involved. For if we 
now introduce the assumption that all of the 
electrons have the distribution given by wooo of 


2 3 4 
r (Bohr units) 


Fic. 1. Metallic wave function for lowest electronic state. 
The corresponding energy is —0.6 Rydberg units and the 
proper boundary condition is satisfied at 4.04 Bohr units. 
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Fig. 1 (with factors (1)), it is found that this set 
of one-electron functions forms a practically self- 
consistent solution of Fock’s equations for the 
lattice. 

The potential V(x, y, z) which enters into the 
wave equation of .an electron in Fock’s system 
is not the ordinary average of the potentials of 
the other particles for all configurations, but the 
average of the potentials with each configuration 
taken with the weight it has under the assump- 
tion that the considered electron is in the point 
x, y, 2. We have seen Eq. (6) that there is a hole 
in the otherwise uniform electron fluid around 
every electron because the probability of two 
electrons having parallel spin being very near is 
very small. We shall call this the Fermi hole. 
Its effect is such as to make the potential more 
negative than it would otherwise be. Since its 
shape is practically independent of the position 
of the electron, the change of the potential 
arising from it is constant and it alters the energy 
but not the wave function. The previous work 
merely gave an estimate of this change and we 
shall give a more accurate calculation here. 

Because the wave functions of electrons of 
higher energy are given by multiplying Yooo by 
(1) only in the case of free electrons (i.e., when 
Yoo is constant), we have investigated this very 
important point in more detail (cf. Section II), 
and have found that the energy differences 
between the different states is the same as that 
obtained when the electrons are free. The wave 
functions are not simply yoo multiplied by (1), 
however, but each is multiplied in addition by a 
factor which is nearly constant within a sphere 
one Bohr unit in radius but becomes exponential 
outside. 

In all of the work sketched in the previous 
paragraphs we have been able to restrict our- 
selves to the one electron picture, that is, the 
wave function for all of the electrons may be 
assumed to be a determinant of single electron 
functions. This leads to correlations between 
electrons of parallel spin of the type expressed 
by the Fermi holes and to none whatever 
between electrons of antiparallel spin. There are 
such correlations, however, arising from the 
mutual repulsion terms, but they lie beyond the 
scope of Fock’s equations. We shall call these. 
holes ‘‘correlation holes” and the corresponding 
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energy “correlation energy,” and shall attempt 
to handle them in Section ITI. 


2. 


Under the assumption of a uniform charge 
distribution the potential within the s-sphere at 
the distance r from the ion is 


V(r) + (3e?/2r,) — (er? /2r,°), (2) 


where the first term arises from the ion, and the 
other two from the electron fluid inside the 
s-polyhedron. .In reference 1 the wave equation 
for the potential V(r) alone—which was taken 
from Prokofjew’s work®—was solved and the 
corresponding characteristic value determined. 
The correction for the fact that this is not exactly 
the energy of the corresponding wave function 
is taken care of in Appendix 1. The second term 
in (2) is independent of 7, leaving the wave 
function unchanged, and gives a positive contri- 
bution 


Fs = 3e?/4r, (3) 


to the energy. A factor 4 enters, since the inter- 
actions between electrons have to be counted 
only once. 

In order to take into account the last term, 
the Schrödinger perturbation theory may be 
applied and this yields 


e 
Ex = a( = 2r, f Vra 
et (S wyirdv)? 
SR, oy 


4r T E-E, 


up to the second approximation where y and E 
are the solutions of the wave equation with the 
potential V(r) alone, which we know from 
reference 1. The factor % arises for the same 
reason given before. To determine the first part 
of (4) we have numerically evaluated the integral 
over yr? for three different values of the lattice 
constant, i.e., 7, and the corresponding energy. 
The results are tabulated in Table I, the energy 


TABLE I. 
E f, Svrdv f 
— 0.60 4.05 9.90 0.994 
—0.55 4.74 12.95 0.962 
—0.50 5.48 16.50 0.916 


3? W. Prokofjew, Zeits. f. Physik 58, 255 (1929). 
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being given in Rydberg units, r, in Bohr units, 
and the integral in squares of Bohr units. For 
the actual lattice constant r,=4, the integral 
has the same value that it would have if y were 
a constant. The last column gives the ratio f of 
the numerically calculated integral to that calcu- 
lated with a constant y for the other cases. 

In the second part of (4) the denominator may 
be replaced by a mean value E— E, and the 
summation in the numerator gives JS Yr'du 
—(SWrdv)? in the usual way.‘ By taking y to 
be constant for this small term one obtains 


ais (2 9 
ER 56 200 


= —0.005e/r. (5 
Jra (8) 


The mean value of E, has been estimated to be 
+0.3 Rydberg units. The effect of the perturbing 
terms in (2) on the wave functions can safely be 
said to be extremely small because of the small- 
ness of (5), and their total contribution to the 
energy is 


Fre = (— 0.15f— 0.005)e?/r,. (6) 


The fact that we must deal with the s-polyhedron 
instead of s-sphere is taken into account in Ap- 
pendix 2. 


3. 


In the preceding section we have calculated 
using the ordinary average potential. We know, 
however, that in order to calculate the potential 
of an electron at a certain point we should aver- 
age over the configuration of the other electrons 
with the weights which they would have if the 
electron under consideration were at the given 
point. It was shown in reference 1 that if there is 
an electron at a given point the probability of 
another electron being at a distance 7 from it is 


e sin (r/d’)—(r/d’) cos (r/d’)\ ? 
a ee Sp 07 
tas a I 


The factor $ expresses the fact that only the 
distribution of charge with parallel spin is af- 
fected, v is the atomic volume and d’ = (w/3r)t. 
(7) was derived under the assumption that Wooo 
is constant and that the higher ones are obtained 
from it by multiplication with (1). Both as- 
sumptions will be shown to be practically correct 


* A. Unsöld, Zeits. f. Physik 43, 563 (1927). 
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insofar as energies are concerned. The total 
amount of charge removed because of the hole 
expressed by (7) is exactly e and may be viewed 
as being concentrated in the electron at the 
center of the hole. To this approximation it has 


E| 
2 vo 0 7 


Since this energy’ does not depend on the 
position of the electron, it is to be carried 
forward at once as an energy with a factor 4, in 
order that the interactions between pairs of 
electrons is taken into account only once. We 
have, therefore, 


Eg = —0.458e?/r,. 


(r/d)? 


(9) 


If the wave function were not entirely flat in the 
lowest state, (7) should be more rigorously 
multiplied by voyo. The integral (8) will hardly 
be affected by this, however, because the regions 
in which yoo changes appreciably are small 
compared to the distance in which (7) has 
appreciable variation and the mean value of (8) 
for different positions of the electron will differ 
from the calculated value by even a smaller 
percentage. 

The sum Eu+En+Er from the viewpoint of 
Fock’s equation, gives the effect arising from the 
fact that the potential at any point is not exactly 
V(r) (where r is the distance from the nearest 
ion), as it would be if the hole in the electron 
fluid of the other electrons extended exactly over 
the s-sphere. It really surrounds (cf. Fig. 2) the 
electron under consideration spherically in such 
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been assumed that all of this hole is in the 
parallel charge distribution—an assumption not 
exactly correct, however. 

The decrease in the potential energy of the 
electron at the center of the hole is 


34/362 33/3 


2r 287287, 


(8) 


a way that part (A) of the s-sphere is left 
uncovered. On the other hand, it extends over 
regions (C) outside the s-sphere. Now Es repre- 
sents the effect of the entire hole (B+C), while 
En+ Er represents the effect of the lack of hole 
in the regions A and B. The fact that we made 
all calculations under the assumption that the 
electron density is constant, will not greatly 
affect our final result, since in the largest region, 
B, ıt does not matter which density we use, as 
long as we use the same density in both cases. 


Fic. 2. Schematic diagram of prominent regions in the 
unit cell. 


An error will be introduced only by using an 
improper density in A and C—but here yoo is 
very nearly constant. 


Il. Tue FERMI ENERGY 


4. 


Our next task is the calculation of the zero-point energy of the free electrons. First of all, an 


error in reference 1 should be corrected, for it was stated there that the zero-point energy is actually 
smaller than it would be for free electrons. It was observed that Yr»: = Yo exp [2mi(vıc+ rey 
-+»32)/L] possessed the proper transformation properties for a state with the quantum numbers 
v, v9, v3 (the ‘‘components of momentum” being’ 2rAvı/L, 2rħv:/L, 2mhvs/L). Moreover its mean 
energy was higher than that of Yoo by the amount 277h?(»?+ ».?-+- v3?) which is just the Fermi cor- 


5 This formula was first found by F. Bloch, Zeits, f. 
Physik 57, 545 (1929). 

6 h in this paper is Dirac’s k, namely, Planck’s constant 
divided by 27. We would like to correct another error of 
reference 1 here. The subtraction of 2500 cal. from the 
binding energy at the end of Section IV is incorrect. The 


fact that the binding energy of the inner shells decreases 
with increasing binding of the valence electrons does not 
appear in the Prokofjew picture because the potential 
energy of the valence electron is taken to be pV instead of 
ipV, so that it is not necessary to consider the change 
of the potential acting on the other particles. 
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rection for an entirely free electron. As the real y+»; is the solution of a minimum problem, it was 
concluded that its energy is even lower than that of W125. 

This argument is not correct, however, since WY», has the lowest energy of only those states 
which are orthogonal to the states of the K and L shells. This is correct for Yos but not for Wrives 
and the real Yr»; should be calculated just as Yon was—by solving a differential equation. 

At any rate, it can be assumed from the transformation property of Yi»; that it has the form 


Yon; = Xros EXP [2rilrix+ vey t+ vgz)/L ], (10) 


where xr»: has the translational periods of the lattice. By inserting (10) into the Schrödinger 
equation, one obtains 


Xrive3 = FrwwsxXvivarı, 


H Xros + 


2rih? ð ð ð 
( Vy— + v2 —+ vs — 

Ox oy 02 

Erv: = Friwo + (2 wh? /mL?) (v2 + pet v3). 


If we neglect the second term on the left side to begin with, the equation is identical with the equation 
for Yow, which we shall denote by y in this section, the other solutions of (11) with uen=,=0 
being designated by y.. To this approximation Ess = Egoo+ 27? h? /mL?( v1? +v + v) which is just 
the result of reference 1. 

We shall take into account the second term of (11) by the Rayleigh-Schrédinger perturbation 
method. The first approximation gives zero, since / yoðyo/ðx vanishes, and the second approximation 
will be proportional to the »’s. From considerations of the s-polyhedron, the boundary conditions 
were found to be such that the value and derivative of the wave functions should be equal in the 
two points at which a line perpendicular to the boundary plane cuts the boundary. We know further- 
more from group theory that all solutions y, of the unperturbed problem are either even or odd 
functions of x, y, z. Now if we calculate the second approximation for the energy, we need integrals 
of the type SYody./dx, and since yy is an even function of the coordinates, Y, must be even in y, 2 
and odd in x, or the integral vanishes. Therefore, all three perturbing terms in (11) will give non- 
vanishing matrix elements only with different y.. The second approximation for the energy is, 
therefore 


m 


(11) 


4th? | Sv. dyo/ dx |? 
= 2 2 2 Ps ec ae 
Fries; mL? (vi + v2? + vs Le E,-E. ’ (12) 


since the integral with the y and z-derivatives can be replaced by integrals with x derivatives, E, is 
the energy of the state y.. The sum in (12) contains one positive term (arising from the 22 level) 
and all other terms are negative. Since the value of the positive term is very sensitive to the actual 
shape of the wave function Yo, we tried to transform (12) in a form more suitable for the calculation 
in the following way. 

If yo and Y, were the characteristic functions of an atom, (12) could be evaluated by the Thomas- 
Kuhn sum rule and would give just the negative Fermi correction, so that the sum of the first and 
second approximation would be zero. In the present case of periodic boundary conditions, however, 
the partial integrations cannot be performed in the same way. We obtain from the Schrédinger 
equation for y, by multiplication with xp) and subtracting the corresponding equation for yọ multi- 


plied by xy, 


h? h? Io 
(E Es) | bate= -— | Eeoa ved obo) ]—— |. (13) 
2m m Ox 
The first term can be transformed by Green’s theorem into a surface integral and we have 
Ovo m(Eo -E,) Ir d(x) 
foa = E (var f (= — vx yar, (14) 
Ox h? ðn ðn 
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where 0/0” denotes the normal derivative. For the usual boundary conditions, the last term in 
(14) would vanish. After replacing one of the integrals in (12) by the expression (14), the summation 


over x can be carried out in the first term by the completeness-relation’ and yields 


(41H mL ret rtd) f xbolavo/ dado as) 


Since yo is a function of r alone, we have 


ô po 2r Ts Ove 

fea = — r——dv = 3voYolr.)? — 3. 

Ox 3 0 Or 

The —# gives the negative Fermi correction and cancels the first approximation. The first term, 
however, gives an expression very similar to the Fermi correction, namely 


2 


(v +r? + v3")voWo(re)s (16) 
mL? 
the difference being that the value of Yow at the boundary enters instead of the mean value. 
The second term of (14), inserted into (12), gives 
2r’h‘ S (Y. OYo/dx)dv OW. 
= mtrt E I feas, (17) 
m?L? x Er-E. on | 


since all odd y, vanish at the boundary. To this sum the 2p state does not contribute anything 
more, because its wave function is practically zero outside r=3. Similarly we found upon direct 
computation, that the integral /'Y.dy,/dxdd vanishes “‘accidentally” for r,=4 for the next higher 
odd state. The energy of the next is of the order of magnitude +5 Rydberg units and we have 
replaced E, in the denominators of (17) by a mean value E, of this magnitude. Now /y,0yo/dxd0 
is the expansion coefficient of y, in the series for dy,/dx so that the summation can be carried out 
in the numerator and gives /xyo(d/dn) (dyo/dx)df. Since Yo is a function of r alone, (17) can further 
be transformed into 

rhs 4r yolr.) 
(v? + vo? + v3?) —r,*Po(r,) N 
3 ðr? 


(18) 


mL? 


Here 0°y)/dr? can be calculated from Schrédinger’s equation and we obtain for the entire zero-point 
energy the sum of (16) and (18) 


Enis — Eww = (20? h2/mL?) (n+ v2? + vs") vopo(re)?L 1 +2(Eo— V(r.) /(Eo— Ex) J. (19) 


This equation shows that the ratio of the actual second term of (19) is negligible, and several 


zero-point energy of the electrons to the zero 
point energy of free electrons is 


vovo(rs)*[1-+2(Eo— Vlr)/(Eo-Er)] (19a) 


can be larger than 1. It approaches zero for 
very large r, but increases greatly with decreasing 
r, when the pressing of the valence electrons 
into the inner shells becomes appreciable. The 

? This is exactly London’s reasoning (Zeits. f. Physik 39, 
322 (1926)) with the only difference that the partial inte- 


gration must be carried through more carefully than in his 
case. 


values of the first are given in the table below: 


3.67 
1.08 


4.05 
0.99 


4,74 
0.89 


5.48 


Ye 
voWo(rs)? 0.76 


Once again, it is found that Yoo behaves at r=4 
as if it were constant. Moreover it is remarkable 
that the second term in (19a) increases the 
Fermi energy over the classical value by a very 
small amount, if the potential is taken as arising 
entirely from the ion, and it decreases it some- 
what below that value if yo is supposed to be 
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the solution of a Schrédinger equation with the 
potential (2). 


5. 

If one plots the Fermi energy against (v? 
-+v2+v3)? Eq. (19) should give the curvature 
of the plot at the zero point. It would not be 
right, however, to assume that it is correct for 
all v. As a matter of fact we know from the work 
of F. Bloch® that for higher » it may depend upon 
the ratios vı : v2 : v3 and from the work of P. M. 
Morse, R. Peierls® and especially L. Brillouin!’ 
that it is not a continuous function of the v’s 
everywhere but has discontinuities corresponding 
to wave-lengths for which the Bragg-Laue con- 
ditions of x-ray reflections are fulfilled. 

According to Brillouin this discontinuity in 
the space of the v’s is on the rhombdodecahedron 
tn=+L/d, ntvg=AtL/d, vetvy=tL/d 
where d is the lattice constant in the space of the 
y's. In order to obtain an idea of the size of these 
discontinuities, we have taken three points of this 
rhombdodecahedron, corresponding to the three 
directions (100), (110), (111) and tried to obtain 


the two energy values for each of them, the 


transformation properties of the corresponding 
waves being determined for the symmetry ele- 
ments of the space group. 

It was an easy matter to find a function which 
had the proper transformation properties within 
the s-polyhedron but it seemed difficult for us to 
satisfy the continuity conditions at the boundary. 
The procedure finally adopted possesses no 
rigorous justification, but it was decided most 
reasonable to select wave functions which were 
continuous only at the midpoint of the fourteen 
boundary planes, with linear combinations of s, 
b, d, f and g functions. For the point n=» 
=L/2d, v3=0 ((110) direction) it was sufficient 
to take a linear combination of the s, d and g 
functions for the lowest function at the discon- 
tinuity and a p function for the upper. For the 
(111) direction the lower was represented by a 
linear combination of p and d functions and the 
upper by a combination of s, f and g while for the 
(100) direction the lower part was a linear com- 


8 F., Bloch, Zeits. f. Physik 52, 555 (1928). 

? P. M. Morse, Phys. Rev. 35, 1310 (1930); R. Peierls, 
Ann. d. Physik 4, 121 (1930). 
a ae Die Quantenstatistik, Berlin 1931, p. 
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Energy (Rydberg units) 
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Fic. 3. Graph showing the energy as a function of 
(v,2-+y2?-++3)!d/L for wave functions of electrons ‘moving’ 
in three prominent crystallographic directions. 


bination of p and d functions and the upper was 
a d function. The results for the observed lattice 
constants are shown in Fig. 3 in which the 
abscissa is (v?+»22-+»32)!d/L and the energy is 
in Rydberg units. The parabola corresponds to 
free electrons; the heavy section representing 
the extent to which electrons would occupy the 
levels, and the other three curves correspond to 
the -designated direction vı : ve: v}. In accord 
with the remarks in previous paragraphs that 
the electrons behave almost as if free, the 
discontinuities are not very pronounced. The 
calculations leading to Fig. 3 do not justify an 
accuracy greater than 0.05 Rydberg units for 
absolute values of energy, although the energy 
differences at the discontinuities are probably 
more accurate. 

In a recent paper which has appeared while 
this manuscript was in preparation, Slater! has 
investigated this phase of the problem from a 
viewpoint similar to that presented here. He 
has made a formal solution satisfying the proper 
boundary conditions at the centers of the eight 


hexagonal faces of the s-polyhedron, which 


u J, C. Slater, Phys. Rev. 45, 794 (1934). Cf. also H. 
10545 N. F. Mott, H. W. B. Skinner, Phys. Rev. 45, 379 
1 ; 
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requires a general function with eight arbitrary 
parameters that he chose to be a linear combi- 
nation of one s, three 9, three d and one f 
function each with arbitrary coefficients. This 
yields a secular equation of the eighth order 
which he solves in a degenerate case, namely, 
when the wave is traveling in a direction or- 
thogonal to one of the coordinate axes. For the 


case of the (110) direction, which he considers. 


in detail, the results agree, essentially, with those 
we have obtained and bear out our conclusions 
completely regarding. the accuracy with which 
the free electron picture represents facts. 


6. 
The unsatisfactory feature of the previous 
paragraph is that the wave functions employed 
are not continuous so that they do not possess 
an energy-value in a strict sense. It is evident, 
however, as Slater points out, that a continuous 
wave function would be obtained by superposing 
an infinite number of wave functions with differ- 
ent angular factors, which is also possible in the 
case of a constant potential, though it would be 
a rather awkward procedure, since we know the 
wave functions accurately. This would corre- 
spond to a development of e?***/¥ into a series 


erinit f(r) + Pi, Qf) HPO, ln), (20) 


where the P; are the spherical harmonics and 
the fı depend only on the distance from the 
center of the s-polyhedron and are essentially 
Bessel-functions. If we determine (20) in each 
‘s-polyhedron, the resulting functions will join 
each other continuously, since they all represent 
the same function e&r"=!Z, For the energy value 
of 0.15 Rydberg units which in the case of free 
electrons corresponds to vd/L=}, the quantities 
rfı'/fı are given at the surface of the s-sphere 
in the first line of the table below: 


s p d Í 

l 0 1 2 3 
Free electron — 0.9666 0.049 1.647 2.7 
Electron in lattice — 1.00 0.044 1.674 2.8 


The last line gives the same quantities for the 
Prokofjew-field of Na for the distance r,=4 and 
the energy value —0.45 Rydberg units, which 
lies 0.15 above the energy value —0.60 of the 
electron with the lowest energy. The first two 
numbers were obtained by graphical integration, 
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and the others are those for a simple Coulomb 
field, since the field of the Na ion does not give 
different values from those of a simple Coulomb 
field for d, f and higher terms. It is seen that the 
deviation of the two sets of numbers in the table 
are very small. Hence, if one adds the s, 2, d, 
etc., solutions of the Prokofjew field with such 
coefficients that the value of each function at 
r=r, is equal to the value of the corresponding 
function fı in-(20) for the same radius, one 
obtains a function within the s-polyhedron which 
will join similar functions in the other s-poly- 
hedra- continuously and even with practically 
continuous first derivatives. This shows that the 
energy of the waves for (v?-+v?+v2)id/L=} is 
higher than that of the wave »,=1,=»;=0 by 
the same amount (0.15 Rydberg units) in both 
cases. The same is true for all v’s which character- 
ize states occupied by electrons. 

If the numbers of the first line were smaller 
than those of the second, the Fermi energy would 
be smaller for free electrons than in the actual 
lattice. A more detailed numerical consideration 
shows that this is actually the case, though only 
to a very small extent. The situation seems to 
be opposite, however, if one takes into account 
the perturbation given in (3). The difference 
does not amount to more than 1 or 2 percent in 
either case, however, and we shall employ the 
numbers corresponding to free electrons in the 
following. 

Of course the coincidence of the two sets of 
numbers arises simply from the fact that the 
electrons in our picture behave very nearly like 
free electrons. We know that the optical proper- 
ties of metals can also be explained very well on 
this assumption,” so it possesses additional 
backing. 

When all the energy terms calculated in 
section 1 are added, one obtains the lower curve 
of. Fig. 4 for the energy of the bottom of the 
Fermi distribution as a function of the lattice 
constant, which is represented by the radius of 
the s-sphere. The upper curve contains the Fermi 
energy and gives the total energy of the electrons 
in the Fock picture. It yields a lattice constant 


_ of 4.76A and a binding energy of 9.0 cal. 


12: C, Zener, Nature 132, 968 (1933); R. de L. Kronig, 
Nature 133, 211 (1934). 
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Fic. 4. Energies as functions of r,. The lower curve is 
the energy of the lowest electronic state while the upper 
curve represents the energy of the entire lattice computed 
on the basis of the one-electron picture. 


III. CORRELATIONS BETWEEN ELECTRONS WITH 
ANTIPARALLEL SPIN 


7. 


In proceeding with a generalization of the 
foregoing sections to include electron correla- 
tions more general than those allowed by the 
one electron picture, we no longer have at hand 


Yılzı) 
W(x Xn, Mi Ya) = (1/2!) 
Vn(%1) 
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guiding principles that are as definite as those 
which we had there. Since the energy is to be 
minimized, the extensions to be made rest 
principally upon the possibility of forming a hole 
around every electron in the charge distribution 
of the other electrons. This possibility is, of 
course, given by the multi-dimensionality of the 
wave function, which allows a different proba- 
bility distribution of the other electrons for 
different positions of the one considered. Since 
such a hole is already present in the charge 
distribution of the electrons with parallel spin, 
we have principally to consider the possibility 
of a similar hole in the distribution of the elec- 
trons with antiparallel spin. These holes are not 
included in the one electron picture (cf. refer- 
ence 3), but we know them to exist in atomic 
eigenfunctions from the works of Bethe! and 
Hylieraas.!* They seem to play an even more 
important rôle in metals than in free atoms.!® 

The wave function employed in the previous 
sections for 2n electrons, may be written under 
neglect of the spin part, as!® 


Yilan) Yılyı) Yılyn) 
š ; F (21) 
Vnlan)| [Ya Yn(yn) 


Here x, represents the three Cartesian coordinates of the kth electron with upward spin and the y 
refer in a similar way to the electrons with downward spin. The functions Yı, ***, Ya, previously 
designated by yw, are the n lowest energy wave functions for an electron in the field of all the 
ions and the other electrons, regarded as a charge distribution. The task of the previous section, 
which we believe to be solved to a sufficient degree of accuracy, was to calculate the functions y 
and the energy of the total Y of (21) as functions of the lattice constant. 

A natural suggestion for the generalization of (21) is to include the coordinates of all 
electrons of antiparallel spin in y,(x,) as parameters. This function will then be a different 
function of x, for different values of yı, ***, Yn, and will have a minimum around every value of 
the set. We shall consider the effect on the function y, and the energies if we replace the continuous 
charge distribution of the electrons by point charges at y:---yn. This replacement will alter the 
field acting on the electron x and we shall use a perturbation method to calculate its consequences. 

We shall set | 


¥i(y1, "tts Yni X1) e Pan tts Yni Xn) Yılyı) ory, Via) 
sYa) = (1/7!) i ; pE i 
Vaiss Yni En) or Ynly cts Yni En) [Va ee Wale) 


13 H, Bethe, Zeits. f. Physik 57, 815 (1929). 
“E, A. Hylleraas, Zeits. f. Physik 48, 469 (1928). 


Y (xi, -et Xni Yu’ 


(22) 


due to the same configuration, also arises from the fact 
that the correlation energy is greater for singlets than for 


15 We are much indebted to Professor Slater for the re- 
mark that possibly the disagreement between experiment 
and his theory for the order and distance of atomic terms 


higher multiplets. 
= Oy Delbrück, Proc. Roy. Soc. (London) A129, 686 
(1930). 
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and denote by 
Folz, eet, Xni Yi °° +, Yn) =yılyı, ets Yni xı): 3 Ynlyı eer Ya; Xn) ly|, (22a) 


the part of (22) which is not yet antisymmetrized 
in the x’s, where |y| is the second determinant 
in (22). When no ambiguity arises, we shall 
designate y,(yi---yn, x) by y(x), in which it is 
to be understood that both x and » correspond 
to sets of three numbers. In addition, it will be 
assumed that both y,(91---yn, x) and its complex 
conjugate appear in the first determinant of (22) 
when this function is not real, that the same is 
true regarding y,(y) in |y|, and that the y,(yı 
+++”, .x) as functions of x are solutions of a 
characteristic value problem so that 


[won et, Yni X) We (yn, 9, Yn; x)dx 


=ô, (23) 
for all values of y. 
It is well known that the total energy of ¥ 


can be written in the form !7 
[fea-+x+ Vot V) Vodxdy, 


where K, and K, are the operators for the 
kinetic energy of the electrons of upward and 
downward spin respectively, Və is the old po- 
tential function calculated from the viewpoint 
in which the electrons are regarded as smeared 
out over the entire crystal, and V is the potential 
function required to account for the change 
incurred by viewing the y-electrons to be point 
charges with regard to their interaction with the 
x-electrons. The first three terms are those dealt 
with in the previous work and yield the wave 
function (21), while the last is a perturbing 
term and will yield a perturbation energy. If 
we retain the old wave functions in the neighbor- 
hood of the ions and replace them by a more 
general solution of (22) in regions where they 
behave like e!"#=/Z, the expectation value of Vo 
will be left unchanged since the electron density 
is changed nowhere, and in addition, it may be 
seen that (7) remains essentially true. The sum 
of the energy correction E and the Fermi energy 


17 Cf. e.g. E. Wigner, Gruppentheorie und ihre Anwend- 
ung. Braunschweig 1931. p. 323. 


F will be given, therefore, by the integral 


taken over the part of the configuration space 
in which the wave function is changed and 
which we shall assume to be the entire space.}® 
It is to bé noted that for an entirely flat wave 
function the expectation value of V, namely 


f f visasi 


is zero. We shall proceed with a calculation of 
the energy correction E. 


8. 


(25) 


To begin with, we shall transform the three 
terms of (24) to a more suitable form. In calcu- 
lating the expectation value of A?/2m-A, we 
can use (23) to carry out the integration over 
all of the remaining x's and obtain 


ke? | 
f f kAd Sy, f f y* 
2m n! 


XAzYyo|y|?dxdy. (26) 


If the same is done for — k?/2m-A,, the terms 
arising from the second derivative of the de- 
terminant are 


(Am? /2m) Dv. (27a) 


Those terms containing the derivatives of the 
Y,(yı° *"Yn, X») and of the determinant may be 
integrated over all of the x’s except x, and lead 
to expressions of the form 


g- Oy,(x,) 9 
ff; a)* Ly | -—]|y|dx,dy. 
Oy, 0% 


If this is added to the similar term containing 
the derivative of y,* it follows from (23) that 
the sum will vanish upon integration over x,. 
Finally, those terms which contain the second 
derivative of Y are of two types, namely, 


18 The difference of the two volumes is negligible. 
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— (R/Am)(1/n)L, f f V(x) *Ay p(x) |y |2dx,dy (27b) 
and 


— (i? /2m)(1/n )) 2 f f Cyle) pultu)" — Yallu)" Yale) *] grad nuy(xr) graduvyu(xy) |y|2dxdxdy. 


If ¥,¥,* we can add to the latter the term in which y,* replaces 4, and the sum will vanish upon 
integration over x. Hence the only terms resulting from the first term in the bracket are 


2 
—hQ/2m)1/n)Z f | f p(x)" gradun ¥(x,)dx,| | |%dy (27c) 
aad those from the second are such as to yield 
2 
—(#/amya/n E f | Sur" gradun Yo(a)de| Iyl’. (27d) 
nr 


If =v in (27d), we obtain (27c), so that the sum of (26), (27a, b, c, d) and 


(1/n DE f fiver 2 yni) |EV on + Yn; RLY |2dy 


leads to 


4r?y? 
2 


= )| ¥(x)dx| y|2dy, 


1 k2 
-E f fuo Vor ngaia) =È Atant Hant 
n! 2m 


2 
|yl?dx=E (28) 


—(e/am\(1/n EX f | [1.6 gradn wata) 


for the change in energy since 
F= (h?/2m)2>. (4r?v?/L?) 


is the Fermi energy. The summations over » and v are to be extended over all occupied states. 
In the following it will appear from the form given y¥,(y1° + +n, x) (i.e., see (29)) that the last ex- 
pression in (28), which could only lower the energy, vanishes. 

We shall proceed under the assumption that y,(yı* **Yn, x) has the form 


(91, ee re Yn; x) =errivz/Ly (x—Yı, eee R x—Ya)s (29) 


in which x, to the approximation considered here, is a constant plus a sum of functions, each of 
which depends on one argument x— yı. When substituted in (28) this yields 


E=5 f f xe" Vy <+, Yn; x) — (B?/2m)(2A.+(4ri/L)v-grad.)]V,- |y|*dxdy. (30) 


In order to minimize this x, must satisfy the equation 
a (R/2m)Ax,— (27th? /mL)»v- grad X+ Vina, ctt, Yn; x) X= &,Xo(Yı, Ya x). (31) 


Eq. (31) is much the same as (11), the principal difference being that the relative mass 3m 
appears instead of m. 
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The characteristic values e, of (31) will depend 
on the position yi--:y, of the electrons with 
downward spin, and we must average this 
characteristic value using |y|? as a weight- 
function to obtain a final value. It is possible to 
perform this general calculation to a sufficient 
degree of accuracy using a modification of the 
usual Rayleigh-Schrödinger perturbation process, 
since it does not converge in its usual form, 
but this involves a rather lengthy computation, 
so we shall content ourselves here with an 
estimate of the solution to (31) and present the 
more general solution at a later time. 


It is evident that if the y’s are to be most 


favorable for a large negative value of the 
correction energy. we shall want the regions in 
which no y occurs to be as large as possible. 
In these regions x,(y::°-yn, x) will have large 
maxima and the electron x will have the greatest 
probability of being as far as possible from the 
electrons of opposite spin. A secondary effect 
resulting from this will be an alteration of the 
original Fermi distribution |y|?, and, as a 
consequence, (8), but we shall not take this 
into account in the present calculation.'® 

The most disadvantageous configurations for 
the y's, that is those giving the smallest —e,(y 
-++y,), are the close-packed lattices, while the 
most advantageous is one in which all of the y 
electrons are at one corner-point of the lattice. 
This latter is practically impossible for the Fermi 
distribution, however, while the former possess 
the greatest probability. A graphical integration 
of (8), however, shows that the mean distance of 
the nearest electron with parallel spin to a given 
electron is 1.17, in contrast with the nearest 
distance of 2.37, for close-packed configurations. 
For this reason it seems reasonable to us, as far 
as energy is concerned, to make use of a ‘‘mean 
configuration” of the electrons with downward 
spin in which pairs of electrons form a body- 
centered lattice. In this the nearest electron will 
be much closer than it should be but there are 


19 Tt cannot be taken into account without also replacing 
the effect of electrons with parallel spin on each other by 
point charges in a similar way to that carried through here 
for the electrons with antiparallel spin. If this is not done, 
the two kinds of electrons will push each other into different 
parts of the crystal, thus invalidating the assumption 
under which we have calculated their mean effect. 
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no clusters of more than two electrons present, 
which compensates the fact that the pairs are 
at the most disadvantageous position. 

The energy e,(yı' * -Ya) of this mean configura- 
tion can be calculated very simply by the 
method employed in reference 1; the radius of 
the sphere surrounding every electron is (4)!r, 
and the potential energy within the sphere is 


V=2e&/r+er/r?—3.6e/r, (32) 


in which the last term has been introduced in 
order to satisfy (25). We shall designate x,(yı, 
-**, Yn, X) by x(x, y, z) when the configuration 
of the yı, ---, Yn introduced here is implied. 

In order to determine the xo associated with 
the lowest energy, we shall regard V in (31) as 
a perturbation, for which the unperturbed prob- 
lem is 


— (?/m)AE(x, y, 2) = E(x, 9,2). (33) 


Each of the solutions of this possess an ‘‘azi- 
muthal quantum number” J], these for }=0 and 
1=1 being of the form ri. = Smw,r, and rt, 
=0/dx sin wr, etc., where the « are to be 
determined from the relation tan „ur, = @, 71 arising 
from the boundary conditions. The normalized 
solutions for /=0, are, to a sufficient degree of 
accuracy 


| 1.024 sin 1.43rr/rı 
Eoo = (3/Arrı?)!; Ea = _—__ 


(2rrı)? r 


er an (2«+1)rr/2rı 
«= GEG Y nee re 


sin wır 


2rrı)! 


for which the corresponding unperturbed energies 


are 
(34a) 


The matrix elements Vo, of V may be obtained 
as follows. We have 


noo = 0, No = hw,2/m. 


Vo= | tooV End V 
= - B/4ars) M1 /or) f Vod V, (35) 


in which the Laplacian may be made to operate 
on the V by use of partial integration. From 
Poisson’s equation AV is —8re&(ö(r)—irrı?) in 
which the last term gives rise to a vanishing 
contribution since it, like &, is constant and the 


384 Part II. Solid State Physics 


CONSTITUTION OF METALLIC SODIUM 


integral over it vanishes in (35). The first term 
yields 


Vox = (24) $e? / wr? (36) 
so that the solution of (31) for »=0 is 
xo = (3/4747) 4 — 50’ eg, (37) 
x Ba?2/mt Vex 
and the corresponding energy is?” 
1/(16.2+3.07r,) +0.005. (38) 


Although this is given in atomic units, the 
correction is true for only half of the electrons 
and must be taken in Rydberg units. The 
solution (37) has also been obtained by use of 
numerical integration and is illustrated in Fig. 5. 


© h 2 3 4 5 6 
r (Bohr units) 


Fic. 5. Wave function corresponding to a solution of (30) 
with »=0 and r,=4. 


The wave function of an electron has only about 
30 percent of its mean value in the region where 
two other electrons are situated and only 50 
percent in the region where there is an electron 
with antiparallel spin. 

For v other than 0, we shall have greater 
energy gains because the Fermi distribution will 
be narrower as a result of the presence of the 
field arising from the y,---¥y, electrons than it 
would be without this field. In this case we can 
take the second term of (35) into account as a 
perturbation, the calculation being very similar 
to that employed in Section 4. The first approxi- 
mation is zero, while the second, namely 


(4n2h$/m?L2) v2 >- | S (OVo0/ dx) x«|?/—-Ex (39) 


is evidently negative, and would vanish if Xoo 
were entirely flat. Because of the selection rules, 


20 The Viz must be calculated graphically. We obtained 
Yı=24e/r,. 
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it is sufficient to take wave functions with /=1 
for x,, which are simply derivatives of the func- 
tions (34) to a first approximation, while xo 
must be taken from (37). The sum (39) may be 
conveniently compared with the Fermi energy 
of free electrons and a simple calculation shows 
that it is 4 percent of this quantity. 

If this result and (38) are subtracted from the 
upper curve in Fig. 4, one obtains Fig. 6 which 


Energy (Rydberg units) 
oO 


3 4 5 
rs (Bohr units) 


Fic. 6. The upper curve replaces that of Fig. 4 and 
contains the results arising from the correction to the one- 
electron picture. 


yields r,=4.40 or a lattice constant of 4.75A, 
and a heat of sublimation of 23.2 kg. cal. The 
experimental values are 4.23A and 26.9 kg. cal., 
respectively. For the observed lattice constant 
the calculated energy is 18.6 cal. It may be men- 
tioned at this point that the preceding calculation 
of the correlation energy is valid only in the 
neighborhood of r,=4. 

It is to be noted that the actual x, will not 
have, in general, forms such as those we have 
calculated here. For irregular configurations of 
the electrons with downward spin they will be 
rather complicated linear combinations of these 
functions, possessing energies that are nearly 
the same, however. Each function y, going with 
an energy é, will have a maximum in that region 
in which the potential has a trough allowing 
such a characteristic value and will be small 
everywhere else as a result of, interference of 
the superposed waves.” This does not affect 
the actual wave function (22), for all of the 
electrons, however, since this is a determinant 
of the ¥, and does not change upon taking such 
a linear combination. 


21Cf, the discussion of a very similar problem by F. 
Hund, Zeits, f. Physik 40, 742 (1927). 
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10. 

It is hardly necessary to mention that the 
calculation of the last section must be regarded 
only as an attempt to find the correct wave 
function for the electrons in the metal, and we 
are well aware that we could guess its form only 
roughly. We should like to mention that the 
deviation of the wave function from the determi- 
nantal form (21) using plane de Broglie waves is 
much more pronounced here than in the one 
electron picture (cf. Fig. 5) in which the plane 
waves were a surprisingly good representation. 
They still form a reasonably good approximation 
to the true state of affairs, however, and for 
this reason we believe that the fact that (22) is 
not symmetric under interchange of x and y is 
not of extremely great importance. The function 
(22) still possesses this property if we consider 
the differences of the y, from the de Broglie 
waves as small quantities and neglect the second 
order terms. 

The remaining discrepancy of 3.7 kg. cal. we 
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believe to arise from two sources. First it is not 
evident that the Prokofjew field is completely 
suitable for calculations of this kind and, in fact, 
it is surprising that the atomic levels are as 
good as we have found them to be. Rigorously 
the exchange terms between the valence electrons 
and the closed shells cannot be transformed into 
the form of potentials because they are repre- 
sented by more general operators, and there is 
a possibility that the effect of these is not 
described accurately enough for the states oc- 
curring in the crystal by an effective potential 
field. This important point is being more accu- 
rately investigated in connection with a treat- 
ment of Li to appear shortly and it is hoped that 
more light may be thrown upon it. 

On the other hand, it is also possible that the 
actual wave function is not represented to a 
sufficient degree of accuracy by a wave function 
of the form (2). It does not seem to be easy to 
use wave functions of a greater generality, 
however. 


APPENDIX Í 


It is possible to continue the wave function with the same spherically symmetric potential over 
the whole volume of the s-polyhedron. If one assumes this % to be valid in all the s-polyhedrons of 
the lattice, the resulting wave function will be continuous everywhere, but the derivatives will be 
discontinuous at the boundary planes. The energy, therefore, is not the characteristic value of the 
differential equation, but must be calculated as the sum of kinetic and potential energy. The cor- 
rection for the potential energy will be given in Appendix 2, while the correction for the kinetic 


energy is 


72m) f (grad Y)’dd— | (grad vo), (1) 


where Vj is the region of the s-sphere outside of the s-polyhedron and V; is the equally large volume 
of the s-polyhedron outside of the s-sphere. In the neighborhood of the s-sphere, we can set 


he he dp(r,) 
— grad y=—— (r— rs) = (V(r) ~E) g(r) (r — ra), (2) 
2m 2m dr 
so that (1) becomes 
(2m /h®) (V(r) Ele] f e-rya- f e- rd), (3) 
Vi Va 


Now the mean value of r—r, is certainly smaller than 70/20, Vi is smaller than »/10, and ¥(r.)2v0~1, 
so that the first part of (3) is 


(2mr 2 /h*)(V(r,) — E)?/4000 = 0.0004 Rydberg units. 


In addition to this, the second term of (3) essentially cancels the first, so that the total correction 
really is negligible. 
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APPENDIX 2 


It is necessary to calculate the potential energy of a body-centered lattice of positive point charges 
in a fluid of negative charge of density p =y? and then to compare this energy with the electrostatic 
energy of the s-sphere. 

To begin with, we can draw s’-spheres around each positive charge which are smaller than the 
s-spheres and just touch one another. Outside of these s’-spheres we may assume the density to be 
uniform and of magnitude p. Relative to the regions of space outside of the lattice, the energy of the 
s’-spheres is as large as if the negative charge had a constant density within the sphere. This is 
because the potential outside a sphere arising from a spherically symmetric charge distribution is 

-independent of the radial distribution inside of the sphere. 

The total electrostatic energy of the lattice may now be divided into two parts: first, the energy 
E, of a body-centered lattice of positive charges v,9=e with a uniform negative charge-density and 
second the difference, Es, of the inner energy of an s’-sphere with the actual s’-sphere on the one hand 
and an s’-sphere with uniform charge distribution on the other. Since the density is actually p outside 
of the s’-sphere, the second energy is equal to the difference in energy of the actual s-sphere and one 
with constant negative distribution p. This second part is 


Ey = E,— (35/8 7118/5)(pvg/d) = E, — 3.6557 pvo?/2d, 


where E, is the energy of the actual s-sphere, that is, the energy employed in the text, and 
— 3.6557 pv)? /2d is the energy of an s-sphere with uniform charge distribution. 

If Eı were —3.6557p*v?/4d our calculation would be exactly correct. Since it is actually 
—3.6391p2v?/Ad, as will be shown at once, the total energy is smaller than that used in the text 
by the amount 

0.0166p%v.?/2d = 0.004e?/r., 


which for r,=4 is only 0.62 kg. cal. | 
The calculation of E, is a problem of electrostatics, we have used essentially the procedure of 
Appell-Madelung. Since the proper energy of the positive point charge, which must be obtained, is 
infinite, we must assume positive charges of finite size concentrated into small cubes with edges 26. 
The Fourier expansion of the density is, then, 


œ 


2  an,exp [2rilxx+Ay+ uz)/d] 


x, À, pe—ao 


with 


a sin ax sin ad sin ap/d’adkXu, 
RUN 


where a=278/d and the first line is valid for «+A-+u even and the second for this odd in addition 
to the case x=A=p=0. 
The total energy per atom of the lattice is 


Eu = yd | On | 2/Am(E ++ y?), 
The density distribution of the positively charged cube may be written in the form of the Fourier 
integral 


ff» exp [2ri(xcx+Ay+ u2)/d Jdxdidu 


with 
a(kAy) = pu, sin ax sin aA sin au/d’adkiu 
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and the corresponding energy is 
En = emf ff faCxrp) | / ++ p deddy. 


The energy E, is now simply Eu — En. If we put ö=0 both En and Ey, diverge, but if we take the 
difference before passing to the limit and then set 6=0, we obtain 


N dë aal? (N+) |g KAu)|? 
E= limlim > lal? -SSS Jam, HT daddy 
{ 


Nasco $90 x, A, WEN 4r CANH NEN Hg 


PEN 1 td dadddu 
-im ig i af f ph edhe 
No md IN KEN (N=) PLM pF 
In a, the values for which «+A-+u is odd and the case x=’=p=0 are to be omitted. 
The summation and integration over u were carried out directly and yield 


E “=| = ( 1 h “(ett > 1 „7 ea , ys drd N 
, 2d \«trodd =x) E ath even =) ore K ) CESOL 


which was then calculated directly. The tgh and ctgh may be replaced by 1 when either x or X 
is greater than 3. For x or X greater than 9, the bracketed difference is 1/24(x+%?)! with sufficient 
accuracy, and the integration over x and X was carried out in this region. The terms for which «3, 
433 yielded — 3.57221, while the second sum gave 


Eı= —3.6391 Pu / 20. 


Kal 


The van der Waals attraction of the ions is — 0.12 cal. per mol. 


Note added in proof. In Fig. 3 the upper point of the (110) discontinuity, given by a pure p function, 
lies at —0.33 Rydberg units in place of the value shown. The lower is at —0.34. Although the first 
lies below the parabola, this is undoubtedly of no real significance. 
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The energy of interaction between free electrons in an 
electron gas is considered. The interaction energy of 
electrons with parallel spin is known to be that of the 
space charges plus the exchange integrals, and these terms 
modify the shape of the wave functions but slightly, The 
interaction of the electrons with antiparallel spin, contains, 
in addition to the interaction of uniformly distributed 
space charges, another term. This term is due to the 


l. 


HE attempt has been made in previous 

= work! to give a more general expression 
for the wave function of free electrons in metals 
than that provided by Hartree’s method of the 
self-consistent field? 3 or Fock’s equations. The 
form of the wave function assumed in Fock’s 
equations for a system of 2» electrons, occupying 
rn doubly-degenerate states is 


i Yılaı) Yılza) Yılyı) Wily) 
=| > . ff. . fa) 
¥n(X)) vfs ¥nlXn) Ways) ‘ee Waly) 


where x stands for three Cartesian coordinates 
of electrons with upward spin, and y for those 
of electrons with downward spin. The y, are 
the solutions of a Schrödinger equation in which 
the potential of the charge distribution of the 
other electrons enters as well as the potential 
arising from the ions. 

In a metal the charge distribution of all 
electrons is practically unaltered by removing 
one so that the second quantity may be replaced 
by the former and the potential for a given 


fact that the electrons repeil each other and try to keep 
as far apart as possible. The total energy of the system 
will be decreased through the corresponding modification 
of the wave function. In the present paper it is attempted 
to calculate this “correlation energy” by an approximation 
method which is, essentially, a development of the energy 
by means of the Rayleigh-Schrédinger perturbation theory 
in a power series of e*. 


electron at the point # is given by adding to the 
Coulomb field of the ions the fields of all electrons 
with parallel and with antiparallel spin. The 
former distribution may be obtained by inserting 
u for x, in (1) and integrating over all coordinates 
except x, and u, while the latter is obtained by 
a similar operation with the exception that the 
integration should be carried out over all 
coordinates except yı and u. 

Actually, it had been shown in! * that the 
wave functions %, of the free electrons in a 
Na-lattice are very nearly plane waves e***-#% 
where L is the cube edge of the crystal and y 
stands for a set of three integers, »-x denotes 
the scalar product of v and x. Hence the charge 
distribution of the electrons with opposite spin 
is practically uniform, that of the electrons with 
parallel spin uniform with a “hole” around «.® 

In no wave function of the type (1) is there a 
statistical correlation between the positions of 
electrons with antiparallel spin. The purpose of 
the aforementioned generalization of (1) is to 
allow for such correlations. This will lead to an 
improvement of the wave function and, therefore, 
to a lowering of the energy value. This energy 
gain will be called “correlation energy.” 


The new form of the wave function, assumed in! was 


Yılyı“ Yni X1) 


n! . 

Yalyı' Yni X1) 
1 E, Wigner and F. Seitz, Phys. Rev. 46, 509 (1934). 
D, R. Hartree, Proc, Camb. Phil. Soc. 24, 89 (1928). 


J. Slater, Phys. Rev. 35, 210, 1930; V., Fock, 
Zeits. E Physik 61, 126 (1930). 


Yılyır Yni Xn) 


Vn ' . 


vn) Palys) 


(2) 


‘Yni Xn) aly) Palya) 


4 J. C. Slater, Phys. Rev. 45, 794 (1934); A. Sommerfeld 
and H. Bethe, Geiger-Scheel’s Handbuch der Physik, 
Vol. 24, 2nd part, 2nd edition, p. 406. 

SE, Wigner and F, Seitz, Phys, Rev. 43, 804 (1933). 
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which contains the functions y,(y1- * -yn; x) which are different functions of x for different configura- 
tions yı, ***, Yn of the electrons with opposite spin, instead of the ¥,(x). It is proposed to find 
the best wave function, i.e., that with the lowest total energy of the form® (2). The yı, ---, Yn in 
Y,(yı "Yan; x) are to be viewed merely as parameters. (Cf. Eq. (23) ref. 1.) The total energy of the 
wave function (2) was previously calculated and relative to the solution of Fock’s equations yielded 


the following energy 
1 pyx) |? 
B=— faylyl*{ on -E E| feo: et aa] |, (3) 
n: Yk 


2m vv’ 


where |y] denotes the second determinant of (1), the summation « runs over all 3” coordinates y 
and that over v and v’ over all occupied states, i.e., over all indices occurring in the wave functions 
in (1) or (2); ¥,(y; x) stands for Y,(yı- + -vn3x) and dy for dy,:++dy,. The quantities e, are the integrals 


e,(yı‘ y= f 0% x1)*{ V—(h?/2m) (Az, + Ay, ++ ++ An) voy; xida 
+ f pa(x1)*(h?/2m) Any, (x1)dxn (3a) 


where again V(yı- - -yn; x) is the difference between the potentials at the point x of a charge distribu- 


tion corresponding to Yı(ly), Yely), ---, Ya(y), on the one hand, and point charges at yı, ***, Va, 
on the other. Ä 
It is necessary now to assume for y,(y1, ***, Yn; x) the form- 
Wy; x)= W(x) LH) +hly2—x) ++ +h(yn-x)} (4) 


and that if » and v’ are both occupied states 
[v0 20*av.0:2/8y9d2,=0 (4a) 


so that the second term in (3) vanishes. Both (4) and (4a) will turn out to be correct in the approxi- 
mation to be used. By means of (4) it is possible to transform (3a) so as to get rid of all derivatives 
with respect to the y, after which one may minimize (3) by minimizing e,(Jı, -++, Yn) for every 
combination of the y. This would lead to a differential equation for the ¥,(y1°++y,; x) in which the 
y would be merely parameters. (The solutions ¥,(x) of Fock’s equations which also enter into this 
equation are supposed to be known.) The result of the transformation i is especially simple if one uses 
for y,(x) = e?rir=!Z, namely, 


(Yi Yn) = fro: x)*{V— (h”/m)(A=z— (2riv/L) grad.) |v,(ly; x)dx. (3b) 


In addition to the energy contribution (3) which is negative and was calculated in! there is a 
further one which is generally positive. This arises from the fact that the probabilities of the relative 
distances of electrons with upward spin are changed by the transition from the y,(x) to the 
W(y1, ***, Yn; X). Since the latter will be large for x’s, which lie in regions comparatively free from 
y's, the distribution of the x’s will not be uniform throughout space and they will be nearer together 
than they were under the previous assumption. 


€ The form (2) of the wave function is certainly not the yus(yı- "ya; x) are not too different from the functions 
correct one. It does not belong even to one single multi- y,(x), they all belong to very low multiplicities. This is the 
plicity but is a linear combination of functions of different Aor case, anyway, in which the present approximation is 

“multiplicities” (belonging to different representations of good and it can be expected that the real wave function 
of the symmetric group). If, however, the functions is in this case near to (2). 
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In order to evaluate this energy change, one may first calculate the probability of two electrons 
with parallel spin being at the points x, and x, respectively, if the complete wave function is given 


by (1). Under the approximate assumption that y,(x) =yo(x)e?"*"=/? one obtains for this in the way 
given in reference 5, 


| Yo(%1) Yo(%e) ’gllxı-xe]), (5) 
where Ma) eii 
sin (r/d) — (r/d) cos (r 2 
g(r)=1 -9( =) (Sa) 
(r/d)? 
is the probability of the distance |x, —xz| =r for free electrons with parallel spin, 
d= (w/3r)t= (4/9 m)}r,=0.521r, (6) 


is 1/(2r) times the wave-length of the fastest electron, vo is the atomic volume and r, the radius of the 
sphere with this volume. If we make the assumption y,(y, x) =y,(y, x)e?"*@-”)-2/4, an expression of 
the form (5) is valid for the wave function (2) as well as for (1) and the change of mutual potential 


energy of the electrons with upward spin arising from the transition from (1) to (2) is 


1 
Ex=— fiyl E m), 
n: ó 


where 


Es (Y1 Yn) = Í dxidxa(ne?/2r){ | vy; x1) Why; x2) |?— | Ya (x:)Y(x2)|?} g(r). 


(7) 


(7a) 


The 4 enters once again because the interaction of a pair of electrons should be counted once only. 
In order to be able to evaluate the integral (7a), g(r) has been replaced by 


1 —e-!r/4(1 4-1.6r/d+1.2(r/d)?), 


(8) 


which, as is shown in Fig. 1, runs rather near to g(r). 


3. 


The task of Section 4 will be to calculate the 
wave functions ¥,(y1°+*yn; x) which minimize 
the sum of expressions (3b) and (7a) and to 
calculate Eı+E2 corresponding to these wave 
functions. Before doing this, however, an esti- 
mate of the order of magnitude of the effect to 
be expected should be given. This can be taken 
from calculations of atomic spectra by the 
method of Fock’s equation or Hartree’s field, 
and their comparison with experimental results. 
The best result in this connection seems to be 
that on the normal state of He, where Fock’s 
equation is identical with Hartree’s. The dis- 
crepancy here is? 0.077Ry (Rydberg units) for 
both electrons, or 12 Cal. per electron. This 
must be the amount of correlation energy in He. 


7Cf. D. R. Hartree and A. L. Ingman, Mem. of the 
Manchester Lit. and Phil. Soc. 77, 69, 87 (1933). 


The situation is somewhat more complicated 
in the calculation of the terms of O**, O+, and 
O by Hartree and Black,’ since Hartree’s 
method has been used instead of Fock’s, and 
also because in the latter cases more than two 
electrons play important rôles. For O*+ the 
differences between observed values (in brackets) 
and theory are 


3P(4.050)0.074; 1D(3.868)0.090; 


15(3.658)0.176. 


If we denote the radial wave function for the 
electrons by P(r), these terms correspond to the 
linear combinations P(r)P(r’) multiplied by 
(x-biy)(x’ iy’); xx tyy +22. 
In the first case the correlation energy is very 
small, since the electrons are probably far away 


xy — yx’; 


$! D. R. Hartree and M. M. Black, Proc. Roy. Soc. 
A139, 311 (1933). 
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from each other anyway, because their spins are 
parallel. In such cases in the theory of metals 
we do not take into account any correlation 
energy at all. The discrepancy in this case may, 
to a considerable extent, be due to the use of 
Hartree’s instead of Fock’s method, and cer- 
tainly would be further diminished by the use 
of the latter. On the other hand, in the case of 
singlet’s, Hartree’s equations correspond much 
more closely to Fock’s equation and the increased 
discrepancy of about 0-050Ry is probably due 
to the neglect of the correlation energy. It is 
not quite clear, however, why it is so much 
greater in the 1S than in the !D term. 

A comparison of experimental and theoretical 
values in Ot and O points in a similar direction, 
the correlation energy is smaller though in these 
cases by a factor of the order 2. It is evident 
that it must diminish eventually if one goes 
over to more and more loosely bound electrons, 
since because of the lower electron densities the 
total interaction energy diminishes and the 
correlation energy is only the non-appearance 
of part of this. 

If one goes over to a metal like Na, on first 
sight the effect could be expected (because of 
the low electron density) to be much smaller 
than in He, about as great as in O. This will 
not be quite so, however, because the fluctuations 
in the potential of the electrons with downward 
spin will be greatly increased by their great 
number. The effect to be calculated in the next 
section will be about equal, therefore, to the 
effect per electron in He. 
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4. 


Although the actual wave functions y,(x) in 
the Na lattice are actually different from plane 
waves ¢?rv-2!4 in the Hartree-Fock approxima- 
tion, we shall use plane waves for y,(x) in the 
subsequent calculation. Since only integrals over 
the unperturbed functions occur in the perturba- 
tion ‘calculation, this will not introduce a great 
error, because the y,(x) are extremely near to 
plane waves in much the greatest part of the 
volume. 

The whole following calculation will be per- 
formed in the approximation which corresponds 
to the second approximation in the Rayleigh- 
Schrédinger perturbation theory. The wave 
functions will be developed into a Fourier series 


Wily; x)= Lert Y apeti), (9) 
u 


eriv-zIE being taken as unperturbed function and 
Vivi, ***, Yn; x) the perturbation. The «,,, 
which are functions of the y, are supposed to be 
small, so that third order terms of a@ and V will 
be neglected for the energy, and second order 
terms for the wave function. 

For the actual calculation it can Be seen, first 
of all, that one can replace a set of ¥,(y; x) by 
any orthogonal linear combination of them with- 
out affecting the final result. The orthogonality 
condition between y,(y; x) and Y» (y; x) gives 


yy tat + DL aarp = 0. (10) 
H 


By Schmidt’s method one can build a set of 
orthogonal y,(y; x) such that æ,» =0 for v<v’ 


Fıc. 2. 
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if y’ is occupied. It then follows under omission 
of the last term in (10) that a,,-=0 if v and »’ 
are both occupied states, whence the summation 
over a in (9) must be extended only over the 
unoccupied states. It is then seen at once that 
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not be zero for all values of the y, but they 
evidently will not change the wave functions 
and merely give a contribution to the energy. 
The average value over all configurations of this 
contribution is zero, however, since the mean 


value of the potential of different configurations 
of charges is equal to the potential of the mean 


(4a) is satisfied in our approximation. 
In order to calculate e,(yı- - -y„) one must first 


calculate the matrix elements of V(yı, ---, 23x). charges. Therefore, we shall set V, (yi, ** *, Yni) 
The matrix elements V,,(y1, +++, Yai x)= Vo will =0. For uzv we have 
Vu Yni x)= LS f erial E V(y; x)dx 
= —(1/4r?(u— er) [erkenne V(y; x)dx (11) 
= (e?/r(u PR y)?L) > ettiyi 
«al 
because of Poisson’s equation. 
For e,, (3b) yields 
th? 
e,(yı° Yn) = 2 (aru Vost e Ka FQ ay! yy! Vaw +> (u? u‘ v) | ayy), (12) 


» mL? 


In order to calculate the e,’(y1: + +n) by means of (7a), the charge distribution for Y,(y; x) will first 


be found: 


| 1,0; x) | 2 L3+ LI (apert L oy, pte? Oe a L) HLL cy, * ary pre? TiL, (13) 
# gu! 

Now g(r) is 1 minus the function of the “hole,” which decreases rapidly with increasing r. The 1, 

inserted into (7a), simply gives the energy difference of a uniform charge distribution and that 

corresponding to Y,(y; x), namely, 


| ary |? 


(14a) 


Avy Avev—p 


tener] Y h, 
adalu y 2/2 
2 „dr (u—v)?/L 


n Ar2(u—v)2/L2 


in which the terms higher than the second order in a are omitted, and R means that the real part 
of the following expression is to be taken. The second sum must be taken only over those y’s for which 
2v — p is unoccupied, while y itself is an unoccupied state in all summations. For the calculation of the 
other part of (7a), arising from the function of the hole, one sets xı+r for x2 in (7a), introduces (13), 
and performs the integration over x;. The result 

2L-*R (2 | yy | + E a0, l O he 


Z—p 
unoce. 


must be multiplied with ne?/2r and the function of the hole g(r) — 1, which must be taken from (8), 
and integrated over r. Setting o =2r(n—v)d/L, this Ie 


3.75 


1.063 
Yo) (14b) 
(1+02/2.56)° 


1+02/2.56 Paaren. 56)? 


Ard’ne? 


2,5623 


R(Zlanlt+ > ana): (—— 


2y—p 
unoce, 


Added to (14a) this gives with help of the relation irai 
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2% e 
er (Yır Yn) = = R(Elanlt+ 2 ann) (o), 
te Ea 


(14) 
1 0.415 1.465 


o2(1+02/2.56) (1+0?/2.56)? (1+02/2.56)3 


ae’ (c) is given graphically in Fig. 2. 

This formula for the increase of potential energy between electrons with upward spin is not 
exact and may be viewed as containing two parts: First (14a), the increase of the potential of the 
space charges, due to the less even charge distribution for ¥,(y; x) than for y,(x). This increase is 
lowered by the second part (14b), caused by the greater efficiency of the Fermi hole in a non-uniform 
charge distribution. The first neglection was made in setting Y,(y; x) =e =i IY (y; x) when 
calculating e,. This tends to increase the e, especially for those v, for which it is large anyway, 
because it overemphasizes the unevenness in the charge distribution. The second neglection was to 
keep no terms higher than the second order terms in a. This certainly decreases e,’, because part 
of the uneveness in the charge density is due to the higher terms, especially to those of the fourth 
order. Finally, the normalization constant, which is smaller than 1, enters in the second power in 
(14) and only in the first in (12). Its omission again increases (14). In the whole these errors will 
about compensate. 

The final quantity to be minimized is, after omission of the higher order terms, 


€y+ e =R} {2a Vout (trut try) | Ayu | 2 Firu Cup Ovary} ’ (15) 
p 
where 
4r?h? ; , 2? e r 2r(u—v)d i 
„= TON ty’=—--—e’ ———). 
en, ea) (15a) 


V,, is given in (11), e’(o) in (14), v is an occupied state, the last term in (15) should be taken only 
if 2v— p is an unoccupied state. By setting the derivative of (15) with respect to a,,* equal to zero, 
one obtains 


Var + (top tt), 0 (16a) 
if 2y—y is occupied. If it is unoccupied 
Vou* + (trut tvg) Qyy Firs Oye p= 0, Voy + (ae +t,,’) rau Firg = 0. (16b) 


The last equation is obtained by differentiating (15) with respect to a,.,_, and considering that 
V,-„= V,,* and t,2,-,’ = trp. Solving (16), one finds if 2»—yp is occupied 


Ara — Vu tt) (17a) 
Avy — Vrp lror—u/L (irut Erp va t lepton | (17b) 


if 2»—p is unoccupied, while „ is, of course, always unoccupied. These formulas show that the 
Yılyı +*+, Yn; x) do have the form (4) with 


and 


e? er rio-n)-(v-r)/D e? 7i u) (y~z) L 
otga (z o es = m), (18) 
wh \2v-u (u— v)? (trut trp) ne. (u— v)? CEEE [tyov—n) 
fase (17) into (15) one obtains for the total energy 
| V, > | V, | Eror 
e,+ €, et 2 a = 2 ee e eee (19) 


eg tatt ae (iyut trp )bvar—pt boty” 
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Instead of the second term one could write half of the sum of the terms for u and 2v— p, which makes 
it somewhat more symmetric. 


5. 
It would be rather difficult to perform the summation over u in (19) for an arbitrary set of y, 
-+, a. Fortunately only the mean value of (19) with the weight |y|?/r! is needed and this can be 
computed quite easily. Since the ¢ do not depend on the y, one finds 


4 n 


1 
n! fi ee 2 foeon WL) y | erint tran Edy 
t. W\ uw x, k=l 
(20) 


et n 
(= 1—% i-utnn)). 


m(p— vt? \ cmt 0 ner 


The first term comes from x =À, the second from «+. In the second, the summation over X can be 
carried out, and it yields 1 if v—u+ v, is an occupied state, zero otherwise. The whole bracket is, 
therefore, equal to the number of occupied states v’, for which y+ »’ —y is not occupied. Fig. 3 shows 
a cross section of the v’-space through the origin and the point v—u. The radius of the circles is 
(3n/4r)t=_L/2rd. The sphere through the weak circle contains the points »~y-+»’ and the hatched 
part is unoccupied. As a consequence, it is 


- fiv aly dy=—— rla), alo) ! fortel>2 oy) 
— 7 =—n (ru r = 
n! _ rlan) . ” me 3le| Jel? 
_——— for |o} <2 
4 16 
and hence 
1 2Ry n(o)de 2Ry n(o)de 
-ifero er ee 
n! 3r? oLottoa-p+ce'(s)} 3x3 | of — (a+ p)?+ 2ca%e'(a) ] 
let p> le+pi>1 (22) 
le- »Isı Io-pi>1 


Here p=2arvd/L, e=2#(u—v)d/L and the summation over x has been replaced by an integration 
over o. The total energy is expressed in Rydberg units. The constant c is 


c= (24/3tat) (e2md?/h?r,) = 0.11067,/do (22a) 


when expressed as function of the radius of the "'s-sphere”’ in Bohr units dp. 


Y 


N 


Fic, 3, Cross section of »’ space. Fic. 4. Section of g space. 
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0% 


o l-p (ph l+p 2 


T, 


Fic. 5. Note: (1—p?)? should read (1 p9}. Fic. 6. 


In order to perform the integration of (22), one may first introduce elliptic coordinates ı=|o|, 
o=|o+p|=|2rud/L]| in the o-space, using as centers the origin and the point —p (cf. Fig. 4). 
The first integral is to be extended over the horizontally, the second over the obliquely hatched 
region. The first will be a sum of two integrals (a) and (b) in Fig. 5, in both of which n(o) =30,/4 
—o,3/16. The second integral is a sum of three integrals (c), (d) and (e), and in the first two of 
them the same expression is valid for n(o) while n(o) =1 in the last. For de one writes 270;do\00d02/ p, 
of for o?, and e- p=4(o.?—o;?— p). The integration over o2 can be carried out simply and gives the 
five integrals 


ARy et Iee'(os) +202+2erp 36, aPN doy 
Da ae n Melle Can 
4Ry I+ 2ce (a) +20: +2019 30, o1°\do, 
ER er 2ce' (01) -+30;2+p?—1 aan 
4Ry pto 2ow+o;?+p?-1 30, o13\ do, 
E E Pe rer ade 
ie eae 5 
Sr?pY up v—p 4 16/o,v Srp we u—p 0, 


where v stands for (2ce (c1) +0:2)?. A calculation of this quantity for r, =4 shows that it is practically 
equal oı if ¢;>2 and the last integral can be evaluated accordingly. It yields 


4Ryjsi 1 2+p 1 a 
3r2pl\8 272, 2—p 2 Hm: 20 


The other integrals were evaluated numerically and the results are plotted against p. Fig. 6 shows 
the plots for r,=1, 2,4, 8. From this, the mean value of the correlation energy which is the mean value 
of —#F, with the weight p, was calculated, and plotted against r.. The 3 enters, because the whole 
energy correction is present only for half of the electrons, that is, those with upward spin. In Fig. 7 
the upper curve represents the values calculated in this way. The energy is given in multiples of e?/r.. 
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6. 

One must remember, of course, that the 
preceding calculation is only an approximate 
one. Even if one confines oneself to wave 
functions of the form (2), the upper curve of 
Fig. 7 gives the correlation energy in first 
approximation only. The neglections are due to 
three causes: first to the use of an unnormalized 
wave function, second to the neglection of the 
terms with higher than the second power of a 
in e,, when going over from (12) to (14a) and 
third to the non-complete orthogonality of the 
wave functions employed. Our approximation 
is good if the a’s are small. One gets an idea 
about’ the accuracy of the approximation by 
calculating PpS |o,.|?]y|’@y/n! (which is the 
reciprocal square of the normalization constant 
minus one), though an idea only, since I ,.|,,|? 


should really be small for all configurations of 


the y, not only its mean value. A calculation of 
the former quantity shows? that it stays well 
below one, except for large r, and for p which 
are very near to 1. It is in these cases that our 
approximation must be expected to break down. 

The real value of the correlation energy will 
be smaller in these cases than the calculated one. 
The correction with the normalization constant 
could easily be taken into account, as has been 
shown at another place.’® It always decreases 
the correlation energy, not very much, however, 
as the magnitude of the normalization constant 
or the formulas in!’ show. The second neglection 
is probably more dangerous and also more 
laborious to correct. It has been done for one 
point (r,=4) only and for this one very roughly. 
The second neglection also increased the calcu- 
lated value of the correlation energy, since it 
amounts to taking 1—2f instead of (1—f)? for 
the probability of the electron being at a certain 
point, and the minima of 1-2f are much 
deeper than those of (1—2f)? (the maxima are 
lower but less important). 


° The greatest part of the numerical work has been done 
by Dr. M. Vermes of Budapest. A table of the calculated 
values is given here: 


p r= 4 
0 0-006 0-10 0-45 
0-4 0-009 0-14 0-54 
1 0-04 0-30 0-94 


10 To appear shortly in the Bull. of the Hung. Acad. 
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If the electrons had no kinetic energy, they 
would settle in configurations which correspond 
to the absolute minima of the potential energy. 
These are closed-packed lattice configurations, 
with energies very near to that of the body- 
centered lattice. Here, every electron is very 
nearly surrounded with a spherical hole of 
radius 7, and the potential energy is smaller 
than in the random configuration by the amount 
0.75 =e?/r,. This would be the sum of the correla- 
tion energy and that due to the Fermi hole. Since 
the latter one is, 1! 0.458e?/r,, the maximum 
amount of the correlation energy is 0.292e?/r,. 
This value will be attained only if the kinetic 
energy can be neglected, i.e., for 7,= 0, and 
represents the asymptote to the real correlation 
energy curve, which is attempted to be drawn 
into Fig. 7. It appears to run much higher than 
one would have thought without calculation. 
I believe it to be in error everywhere by less than 
20 percent. 

The dotted line at 0.142e?/r, corresponds to 
correlation energy as great as assumed in the 
first calculation® giving 0.6e?/r, together with 
the energy of the Fermi hole. 

The calculated constants of the Na lattice with 
the correlation energy of Fig. 7 and the other 
quantities as in reference 1 are as follows: lattice 
constant 4.62A as compared with the observed 
value of 4.23A. The binding energy associated 
with this is 26.1 Calories, to be compared with 
the observed value of 26.9. The calculated value 
of the binding energy for the observed lattice 


Fic, 7. 


u F, Bloch, Zeits. f. Physik 57, 545 (1929). 
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constant is 22.3 Cal. As far as the lattice constant 
goes, one must remember, however, that both 
the correlation energy and that due to the Fermi 
hole are calculated for a flat wave function and 
the wave functions are flat only for „=4 and 
its neighborhood. 

The magnitude of the correlation energy is 
important for questions of paramagnetism and 
ferromagnetism as well as for questions of lattice 
energy. It modifies Bloch’s original theory on 
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the ferromagnetism of free electrons" in such a 
way that it yields ferromagnetism in fewer cases 
than in its original form. I hope to return to 
this question at another time. 

I wish to express my gratitude to Dr. F. Seitz 
for his kind help in connection with the prepara- 
tion of this manuscript. 


1? A paper of S. Schubin and S. Wonsowsky, Proc. Roy. 
Soc. A145, 159 (1934) which appeared recently, points in 
the same direction. 


Theory of the Work Functions 
of Monovalent Metals 


E. P. Wigner and J. Bardeen 


Physical Review 48, 84-87 (1935) 


(Received April 30, 1935) 


The factors which determine the work function of a 
metal are described in a qualitative way. The work function 
is defined as the difference in energy between a lattice with 
an equal number of ions and electrons, and the lattice with 
the same number of ions, but with one electron removed. 
The work function is then found by first calculating the 
energy of a lattice with ; ions and n, electrons. The final 
formula gives the work functions of monovalent metals in 
terms of the heats of sublimation. This formula is approxi- 
mate, and can claim validity only in a qualitative way, as 


(1) 


T is perhaps not quite superfluous to have, 

in addition to a more exact calculation of a 
physical quantity, an approximate treatment 
which merely shows how the quantity in question 
is determined, and the lines along which a more 
exact calculation could be carried out. Such a 
treatment often leads to a simple formula by 
means of which the magnitude of the quantity 
may be readily determined. A treatment of this 
nature has been given by O. K. Rice! for the 
binding energies of the alkalis. 

We intend to give here an approximate 
calculation, on somewhat different lines, of the 
work functions of monovalent metals, the main 
result being a formula which relates the work 
function with the heat of sublimation. We do 
this for two reasons; first, because a more exact 
treatment seems to involve a great deal of 
computational labor, and second, because the 
connection between the work function and other 
properties is empirically very pronounced. In 
fact, Sommerfeld,? in his original paper on the 
electron theory of metals, has already noticed 
that if one orders the metals according to their 
work functions (or according to their Volta 
potentials), the series is the same as if one orders 
them according to the Fermi energy of the 
electrons, calculated on the basis of the free 
electron theory. The greater the Fermi energy, 
the greater the work function. This fact was 


10. K, Rice, Phys. Rev. 44, 318 (1933). 
(1998) Sommerfeld, Naturwiss. 15, 825 (1927); 16, 374 
1 . 


one of the important factors, the electric double layer on 
the surface, is omitted entirely, and it is assumed that the — 
Fermi energy is as great as if the electrons were entirely 
free. The values obtained from this formula check very 
closely with the experimental values for the alkalıs, so that 
it can be concluded that the double layer is probably small 
for these metals. Finally, the deviations to be expected for 
other than monovalent metals are considered. A more 
exact calculation of the work function of one substance 
(Na) will be given by one of us in an ensuing paper. 


rather puzzling on the basis of the naive free 
electron theories of that time, but an explanation 
was given later by Frenkel.? Although Sommer- 
feld and Bethe‘ have shown that the details of 
his calculations cannot be maintained, we feel 
that his basic idea is correct, namely that the 
binding energy of the electrons (i.e., work 
function) increases with the kinetic energy. The 
connection between the kinetic and potential 
energy, as given by the virial theorem in a 
Coulomb field, has been utilized by Tamm and 
Blochinzev® on the basis of the Fermi-Thomas 
model. It seems to us, however, that it is danger- 
ous to use the Fermi-Thomas model in this 
connection, because this mode! does not yield 
the metallic bond.® 
(2) 

The work function may be defined as the 
difference in energy between a lattice with an 
equal number of ions and electrons, and the 
lattice with the same number of ions, but with 
one electron removed. It is assumed in both 
cases that the lowest electronic states are com- 
pletely filled, so that the electron is removed 
from the highest energy state of the neutral 


‘metal. It is necessary to specify the position of 


the electron after its removal from the lattice, 
because, in general, the work function is different 


3 J. Frenkel, Zeits. f. Physik 49, 31 (1928). 

ıSee article by A. Sommerfeld and H. Bethe in the 
Handbuch der Physik, Vol. 24, Berlin, 1933, p. 424. 

5 {372mm and D. Biochinzev, Zeits. f. Physik 77, 774 
(1932). 

t For a discussion of the Fermi-Thomas model as applied 
to the metallic state, see J. C. Slater and H. M. Krutter, 
Phys. Rev. 47, 559 (1935). 
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for different crystallographic planes. We shall 
suppose that the electron is at a point in the 
neighborhood of a surface plane of the crystal, 
the distance from this plane being small com- 
pared with the dimensions of the plane, but large 
in comparison with atomic dimensions. We shall 
calculate this energy difference by calculating 
the energy E(n,, ne) of a lattice with n; ions and 
Ne electrons where we may suppose that |n;— nel 
<n; so that effects connected with the finite 
capacity of the sample may be neglected. For 
the calculation of E(n,,n.) we shall use the 
method of orbital wave functions. This method 
requires hardly any modification of the scheme’ 
used for n;=n., i.e., for uncharged metals. 

There is only one important difference, that 
due to the electric double layer which the 
electron cloud and the ions may form on the 
surface of the metal. These double layers are 
due to the fact that the electron distribution 
will not be symmetric around the surface ions, 
as it is around the inner ones. It will extend 
partly outside of the limits of the spheres 
surrounding the ions* of the surface, and in the 
inside of the spheres it will have partly greater, 
partly smaller densities than in the inner s 
spheres. This alteration of the charge density 
may result in double layers on the surface of the 
metal. The double layers will be such that they 
generate a constant potential inside the metal. 
The potential outside the metal, however, will 
not be constant, but will vary in such a way that 
the potential differences between the outer 
neighborhoods of differently oriented crystal 
surfaces will be equal to the differences in the 
moments of the double layers of the correspond- 
ing surfaces. These differences are revealed 
experimentally as the differences in the work 
functions of the different surface planes of the 
same crystal. This shows, conversely, that the 
double layer is of the same order of magnitude 
as these differences, i.e., around 4 to 1 electron 
volt.? 

The total energy of the lattice with n; ions 
and x. electrons can be considered as containing 

1 E. Wigner and F. Seitz, Phys. Rev. 43, 804 (1933); 
46, 509 (1934) (I and II); E. Wigner, Phys. Rev. 46, 1002 
(1934) (III); F. Seitz, Phys. Rev. 47, 400 (1935) (IV). 

8 J. Frenkel, Zeits. f. Physik 51, 232 (1928). 

a teal reference 7, II. 


Farnsworth and B. A. Rose, Nat. Acad. Sci. 19, 
777 (1935); B. A. Rose, Phys. Rev. 44, 585 (1933). 


three parts: (1) the energy which the lattice 
would have if the surface ions were surrounded 
with the same symmetric charge distribution 
which prevails in the interior; (2) the energy 
change of the inner ions and electrons due to the 
double layer on the surface; (3) the energy of 
this double layer. We shall assume that the 
electron comes from the interior of the metal, 
i.e., that the double layer remains unchanged, 
and that the density of the electrons changes in 
the interior only. This is, strictly speaking, 
incorrect, as one can easily see that the density 
will be practically unchanged in the interior, and 
that the electron will come from the surface. 
The result for the work function cannot be 
incorrect in spite of this, because the energy 
required to move an electron from the surface 
to the interior of a metal is of a much smaller 
order of magnitude than the energy required to 
remove an electron from the metal (or, the work 
function). If the energy necessary to move an 
electron from the surface to the interior were 
appreciable, the electrons would have already 
rearranged themselves in the neutral metal. 
We shall omit in the final result the change in 
energy of the second and third part of E(n,, ne). 
This is not allowable, of course, but this part of 
the work function is just equal to the moment 
of the double layer, and cannot be taken into 
account without an explicit calculation of the 
latter, which is reserved for a later paper by 
one of us. 


(3) 


We can now use for the calculation of E(n;, ne) 
the scheme of reference 7. The kinetic energy of 
the ions will be neglected, so that the total 
energy contains three parts, 32e,V; for the 
ions, 42e;V; for the electrons, and the kinetic 
energy of the latter. The first part can be calcu- 
lated as follows: First one assumes that the 
density of electrons everywhere, except in the 
s sphere of the ion under consideration, is as 
great as it would be if ne=n;. Then, the charges 
outside this s sphere will have no effect on our 
ion, because the potential of the electrons in 
every other s sphere will be nullified by the 
potential of the ion inside.!° Only the double 


10We neglect the small correction of Appendix 2, 
reference 7, II. 
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layer of the surface will be effective, and we 
obtain as the first contribution to the energy of 
the crystal: 


ni (me/2n)f|¥|2Vdv+4eD], (1) 


where V is the potential of the ion core, D is the 
moment of the double layer, and the integration 
must be extended over the s sphere of the ion. 
The factor n; comes in, because we have n; ions, 
the factor n./n:; because there are only that 
many electrons in the mean in the s sphere; 
(1) should still be averaged over the wave 
functions of the different free electrons. The fact 
that there are ((m;—1)/n;)(n;—n,.) electrons 
missing outside the s sphere introduces a further 
correction : 

z2e,®,, (1a) 
where &; is the potential of the missing electrons 
at the jth ion, and is of the order of magnitude 
(n;—n.)e/R where R is a lateral dimension of 
the crystal. 

The potential energy of the electrons is more 
complicated. First, again, we can supplement 
the missing electrons outside the s sphere in 
which our electron happens to be. The potential 
energy due to the nearest ion is 3V(r). To this 
the electron interaction energy must be added, 
which, if the charge distribution is assumed to 
be uniform, amounts to (,/n,) (3e?/2r, — e’r?/2r,?) 
where 7, is the radius of the s sphere. These 
two terms give, together with the contribution 
from the double layer, if averaged over all 
positions of the electron under consideration in 
the s sphere: 


neL3S |W|?*Vdv+0.6e?n./n.r.—teD | (2) 


with definitions similar to those used in (1). 
The electrons missing outside the s sphere, yield 
the following correction to the energy: 


— (m_/2n;) De®}. (2a) 


Next we must take into account the effects of 
the different holes. The Fermi hole (exchange 
energy) gives —0.458e?/r, if n.=n;. One easily 
sees that r, must be replaced by (n;/n.J)tr. if 
n.»“n;. Similarly, if we denote by r,f(r.) the 
correlation function," the correlation energy for 


in Represented by the lower curve, Fig. 7, reference 7, 
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one electron is —ef((n;/n.)!r,). The total contri- 
bution from these sources is: 


— nel (0.458e7/75)(n</m:)'+e7f((ms/me)'rs) J. (3) 
Finally, the kinetic energy of the electrons gives: 
— (h?n./2m) fy*Aydy, (4) 


which again should be averaged over the different 
free electron wave functions. 

The energy of the double layer due to the 
electrons and ions inside the crystal is simply: 


—teD(n.—n;). 


This energy will change, even though the double 
layer itself is unchanged. 

If we add all these quantities together, (4) 
together with the first terms of (1) and (2) gives, 
because of the Schrödinger equation — (h?/2m)Ay 
+ Vy = Ey just n.E, where still the mean value 
of E over all occupied states of the electrons 
must be taken. This gives the energy Es of the 
lowest free electron level, augmented by the 
mean value of the Fermi energy, F, i.e., 


ne(Eot+ F(n./n;)) ’ 


F= (9/10) (3/27)*(R,/r.7). (5) 


Thus the total energy is: 
E(n., me) =n.Eo+ F(n./n;)) 
+ (n.?/n:) (0.6e?/r.) + (1/2n,)(ns—n.)Z®; 
—n.L(0.458e?/r.)(n./n)t+efUns/ne)tr.)] 
—eD(n.—n,). (6) 


(4) 


The derivative of E(n,n.) at ni=n. is the 
work function ¢ with the negative sign. 


— g=FEot+5F/3+1.2e?/r, — (4/3) (0.458e?/r,) 
— efr) ter, f'(r.)/3—eD. (7) 


The derivative of the sum in (6) tends to zero 
with increasing size of the crystal, and has 
therefore been omitted. If we omit the double 
layer term, and replace the Fermi energy by its 
value for free electrons, the only unknown 
quantity in (7) is Eo. One can express Eo in 
terms of the ionization energy J, and the heat of 
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TABLE I. Work functions of the alkalis. 9 
Metal Ta I+H F Peale, Pexp. 8 x . monovalent 
© divalent 
Li 3.28 7.04 2.07 2.19 2.28 x trivalent 
Na 4.00 6.25 1.89 2.15 2.25? 7 x æ quadrivalent 
2.46! 
K 4.97 5.27 1.22 2.20 2.241 3 
2.243 Ss 
2.172 
Rb 5.32 5.03 1.07 2.20 2.198 N 
2.16! 5 
Cs 5.73 4.70 .92 2.15 1.963 8 
1,87! & 
1.814 $ 
& 
ı A. R. Olpin (see reference 13). 3 
2Z. Berkes, Math. Phys. Lapok 41, 131 (1934). X 
3J. J. Brady, Phys. Rev. 41, 613 (1932). + 
4K. H. Kingdon, Phys. Rev. 25, 892 (1925). by 


sublimation, H, 


I+H= —(0/dn,:)E(n;, n.)- (9/On.)E(n;, ne) 


= — Eo— F-0.6e?/r.+0.458e?/r,te?f(r.), (8) 
which, of.course, is the result of reference 7. 
We have, from (7) and (8) 
g=I+H-3F-.0.6e?/r,+0.458e?/3r, 
—etr,f(r.)/3+eD. (9) 


Table I gives the work functions of the 
alkalis as calculated from (9) under the assump- 
tion that D=0. The Fermi energy for Li was 
taken from the work of F. Seitz," and the free 
electron values were used for the remaining 
metals. The check with the experimental values!’ 
is extremely good, considering the approximate 
nature of the theory, and shows that the double 
layer is probably small for these metals. The 
double layer may, however, give an important 
contribution to the work functions of other 
metals. 

The solid curve of Fig. 1 gives the theoretical 
values of J+H—g in electron volts plotted 
against r, in Bohr units. It is assumed that the 
Fermi energy has its classical free electron value, 
and that D=0. The light broken line connects 
the experimental values for the monovalent 
metals. Most of the discrepancy for Li (r,= 3.28) 
is accounted for by the deviation of the Fermi 

12 See reference 7, IV. 

!3 For the experimental values of the work function, see 
A. L. Hughes and L. A. DuBridge, Photoelectric Phe- 


nomena, McGraw-Hill, 1932, p. 75. For the heats of 
sublimation, see J. Sherman, Chem. Rev. 11, 93 (1932). 


25 30 35 40 45 5&0 5.5 60 
nr (Bohr units) 


Fic. 1. Theoretical curve of I+H-yg for monovalent 
metals, assuming no surface double layer. Broken line 
connects experimental values for these metals. 


energy from its free electron value.” The experi- 
mental values of ]+H-— gy for other than mono- 
valent metals are also shown in Fig. 1. A calcu- 
lation similar to the preceding one could be 
made for these metals by treating all valence 
electrons as free. The work function could then 
be obtained as the difference of quantities which 
are even greater than those for monovalent 
metals. One can obtain an orientation, however, 
by considering only one electron as free, the rest 
belonging to the “ion” core. There will then be 
two main modifications of the foregoing con- 
siderations. First, the Fermi energy will be even 
further from its classical value. It will in general 
be smaller than this if the levels of the last 
electron all belong to the same Brillouin zone; 
it may be greater or smaller if they extend over 
both sides of a gap between two zones. Second, 
a large part of the cohesion will be due to the 
interaction of the “'ions.” Thus in (9), only that 
part of the heat of sublimation should be inserted 
which results from the outermost electron. This 
correction is, indeed, in the right direction to 
bring the experimental points for these metals 
closer to the theoretical curve. 


On the Possibility of a Metallic Modification 
of Hydrogen 


E.P. Wigner and H.B. Huntington 


Journal of Chemical Physics 3, 764-770 (1935) 


(Received October 14, 1935) 


Any lattice in which the hydrogen atoms would be 
translationally identical (Bravais lattice) would have 
metallic properties. In the present paper the energy of a 
body-centered lattice of hydrogen is calculated as a 
function of the lattice constant. This energy is shown to 
assume its minimum value for a lattice constant which 
corresponds to a density many times higher than that of 


the ordinary, molecular lattice of solid hydrogen. This 
minimum—though negative—is much higher than that of 
the molecular form. The body-centered modification of 
hydrogen cannot be obtained with the present pressures, 
nor can the other simple metallic lattices. The chances 
are better, perhaps, for intermediate, layer-like lattices. 


On the Possibility of a Metallic Modification of Hydrogen 


(1) INTRODUCTION 


HE present calculation of the constants of - 


metallic hydrogen has a threefold purpose. 
First, we wanted to explore the possibility of 
preparing metallic hydrogen under high pres- 
sures.! The hope for this, however, appears to be 
small, as the pressures under which metallic 
hydrogen would be stable seem to be unattainable 
with the present techniques. Second, it seemed to 
us important to check the free electron method of 


calculating wave functions and heats of vaporiza- 


tion? in a case where experiments give an upper 
limit to the energy. The obvious instability of 
metallic hydrogen under ordinary pressure shows 
that its heat of formation is smaller than that of 
the molecular hydrogen. This condition proved 
to be only too well fulfilled. Finally, we hoped to 
obtain some information concerning the condi- 
tions for the metallic state and the line separating 
metals from metalloids in the periodic table.? 
This purpose has been achieved by the present 
work to the extent expected, but this point, 
although perhaps apparent to the reader, will not 
be stressed in the present paper. 


(2) CALCULATION 


The calculation itself was a rather mechanical 
application of the principles outlined elsewhere.’ 
First, the energy E(r,) was found for which the 
solution of the Schrödinger equation with the 
ordinary Coulombic potential —e?/r had zero 
derivative at distance r,. This was done by a 
numerical method and, in addition, checked by 
an analytic approximation,‘ which gave a direct 


1It was J. D. Bernal who first put forward the view that 
all substances go over under very high pressures into 
metallic or valence lattices. 

? (I) E. Wigner and F. Seitz, Phys. Rev. 43, 804 (1933); 
(II) 46, 509, 1934; (III) E. Wigner, ibid. 46, 1002 (1934); 
(IV) F. Seitz, ibid. 47, 400 (1935); (V) J. C. Slater, ibid. 
45, 794 (1934). E. L. Hill remarks in a ‘‘Note on the Sta- 
tistics of Electron Interaction” (Physik. Zeits. Sowjetunion 
7, 447 (1935)) that the expressions “‘parallel spin” and 
“antiparallel spin” should be replaced by ‘‘parallel spin Z 
components” and “antiparallel spin Z components,” re- 
spectively. The former expressions are not quite equivalent 
with the latter ones according to the general formalism of 
the statistical interpretation of quantum mechanics and 
can be used only as an abbreviation for them. (This is 
done in the present paper also.) We believe that Dr. Hill’s 
further remarks concerning the papers of this reference are 
based on a misunderstanding. 

3 Interesting quantum-mechanical considerations on this 
point have been put forward by F. Hund (International 
Conference on Physics, October 1934, London). 

4 To be given in Section 3. 
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expression for the wave function needed for 
subsequent work. The plot (lower curve of Fig. 1) 
shows the results of the numerical method which 
gave the more accurate results. Its values lie 
about 0.02 Ry (Rydberg units) below those of 
the analytic formula. In this case E(r,) has no 
minimum, as in case of Na, but instead continues 
to fall to —o for r,—0. In Na the rising of 
E(r,) for small r, results from the energy neces- 
sary to press the valence electron into the ınner 
shells, whereas no such shell exists for hydrogen. 

The upper curve of Fig. 1 is obtained by adding 
to E(r,) the Fermi energy for free electrons, 


(2.2099/r.2)Ry, 


where r, is expressed in Bohr units A?/e!m. This 
corresponds to the calculation adopted in ref. 2 I 
and gives a minimum of E= —1.156 forr,=1.50. 
This would give a density of 0.80 (as compared to 
0.087 in the solid molecular form!) and a heat of 
vaporization of 48.7 cal. into atomic hydrogen, as 
compared with 52.5 cal., the heat of formation 
of the molecular form. We see that the molecular 
form is the more stable, even according to this 
very approximate calculation, but if this were 
all, it would probably still be possible to trans- 
form it under pressure to a metallic form. 

Unfortunately, a great number of corrections 
must be applied to the simple picture given in 
ref. 2 I which decreases the stability of the metallic 
form in this instance. These corrections are given 
for the flat wave functions in refs. 2 II and 2 IIT. 
They are: | 


0 ! 


r — 


Fic. 1. Lower curve—the energy of the electrons without 
momentum, the upper curve—total energy as a function of 
ae constant. Both according to a naive free electron 
theory. 
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(a) The difference between the energy of the electron 
clouds (§2 in II) and the exchange energy (§3 in IJ) 
0.284/r,—0.017/r2(E.—E), (in Rydberg units), (1) 


where Ex is a mean value of the energy of the higher s 
terms and has been estimated‘ to be about 20 Ry/rè. 
The second term in (1) is, therefore, almost negligible. 

(b) The correlation energy, calculated in ref. 2 III, 
which can be represented for r.>1 rather closely by 


—0.584/(re+5.1). (2) 


(c) Acorrection for the Madelung number will be added 
in the next section. 


The energy after the corrections (a) and (b) have 
been added is plotted on a magnified scale on the 
lowest curve in Fig. 3. One sees that a great part 
of the binding energy is lost. 


(3) 


It is very probable that the wave functions 
have nearly the form e’r'=!k, not only in Na, but 
also in the other alkalies excepting Li.5 In H this 
is not the case, however, and there will be further 
corrections introduced into the energy curve 
similar to those calculated for Li by F. Seitz.” !v 
Still the situation for H is rather different from 
that for Li. In H the wave function of the 
lowest energy electron (p=0) is strongly modu- 
lated but the modulation does not increase much 
for higher p functions. In Li the wave function 
for p=0 was quite flat and the modulation 
increased with increasing $. 

For the calculation of these effects an ap- 
proximate method has been used. The equation 
for the wave function of the electron with the 
momentum p, multiplied by ei»'=!h, is® 


Hax = —Ax, — tu- grad x,.—2x,/7= Eu Xur 


3 
u=2hp/e'm. 9) 


Here the e, does not include the kinetic energy of 
the flat wave functions, p?/2m=y?/4 Ry. The 
ee, will give us the change in the Fermi 
energy, and the wave functions x, the change in 
charge distribution, which are both caused by 
the fact that the electrons are not free in the 
lattice. The charge distribution is used in a 


sCf. F. Mott and C. Zener, Proc. Camb. Phil. Soc. 
30, 249 (1934); also J. Bardeen and E. Wigner, Phys. Rev. 
48, 84 (1935); for Li see reference 2 (IV) and J. Millman, 
ibid. 47, 286 (1935). 

6 Cf. Eq. (11), reference 2 (II). The energy in (3) is in 
Rydberg units Ry and the length in Bohr units. 
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recalculation of the interaction of the electron 
clouds and the Fermi hole (exchange energy). 
We shall see that x, can be assumed to be the 
sum of a function of r alone (s part) and a 
function of r multiplied by (u-7) (p part). The 
boundary condition for the x, is periodicity in 
the lattice, which means zero derivative for the 
s part, and vanishing of the p part on the surface 
of the s sphere. 

Eq. (3) has been solved by a perturbation 
method. We shall need only those solutions of the 
unperturbed equation 


(4) 


er An: = Nnlént: 


which are functions of r alone (!=0, or the s part 
of x), and those which are functions of r multi- 
plied by (u'r), (l=1 or the p part of x). The 
boundary conditions for these £ functions are the 
same as for the corresponding part of x. We 
have? "€ 


Eoo = (3/4rr3)}, 
Eni = (3icn/uwn) (u grad) (sin wr)/r, 


Eno =Cn(SIN War) /?, 


(5) 


noo=0, Nano = Nal = Wn’, 
1.4307 1.024 
with oO) =———_, = F 
r, (2rr,)? 
(n+4)- 
wn = (n4 $)—, = 
fa (2rr,)? 


when the last two expressions hold well for n>1 
only. 

Next we must calculate the matrix elements of 
H,. All terms give nonvanishing elements be- 
tween n: and fns: only, except for iu- grad which 
gives elements between nı and £nrı+ı. We have 


(H,)oo: o0= — 3/14; 

(H,)ı0: 10 = 20.2/r,?— 5.65/r,; 

(II no: no = (Hy) mi; ni = (n+ F)’ T?/r; 
(H,)ı1: 11= 20.2/7,? — 3.66/r,; 
(H )aro: 21 = — (tnen/34) dans 

(H )oo: 10= 1.36/15; 

(Z7,)o0: no = 2(6)}/ (n+ 3) are. 


(6) 
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For »=0, this gives, by Rayleigh-Schrödinger 
formulas, 


3 (1.36)? o 24 


e= — -— c —- : 
0 r, 20.2—2.65r, n=2(n+%)'r! 
(7) 
1.36r, © 2(6)}r, 
Se eT 
20.2 — 2.657, n=2 (n+ 4)3 m? 


These are the formulae to which we referred in 
Section 2. For „0, we diagonalized the three 
rows and columns of H, referring to £0, £10 and £11 
and used only for the remaining ones the 
Rayleigh-Schrédinger method. This was ad- 
visable, because (M,)ıo; wand (H,)11; 11 were close 
together. This gave 


3 1.85 
r, 20.2—2.65r,— 6.7242(20.2 —0.67,)-! 
—0.0093. (8) 


Xo= oo + 


The mean value of e„—e with the weight u, 
from n=0 to a=3.84/r, is the correction to the 
Fermi energy; it is plotted in the lowest curve of 
Fig. 2. It is negative, just as in Li, because there 
is no p level below the s level of the valence 
electron. The energy ¢,—é€ is no longer very 
accurately proportional to u?, because the effect 
of the next Brillouin zone is appreciable. 


(4) 

Next the interaction of the electron cloud and 
the more accurate exchange energy must be 
derived. The greater density near the nucleus 
makes it evident that the first will be greater 
than the free electron value assumed in the 
previous section. This is true also for the ex- 
change energy, because the Fermi hole is more 
effective in consequence of the uneven charge 
distribution. The two effects work against each 
other, but in contrast to Li, the former is greater 
here and decreases the binding. 

The calculation of these quantities is a little 
laborious and contains nothing of principal 
interest. It will be sketched only. 

The square of the calculated wave function, 
i.e., the electric density p, was approximated by 
an analytical expression’ containing a constant 


7 The reason for this is that only for this form were we 
able to perform the integrations. 
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002: 7 


0.01 
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Fic. 2. Bottom line, correction to the Fermi energy; 
middle line, correction for the Madelung number; top line, 
correction for the interaction energy. 


and an exponentially decreasing charge distri- 
bution around each ion: 


p=pitp, pı=ell-c)/(Arr?/3). (9) 


p: itself contains two parts: first the exponential 
charge distribution around the ion, in the s 
sphere in which we are, and second the exponen- 
tial charge distribution around the other ions. 
The latter was averaged over all directions so 
that p: also became spherically symmetric in each 
s sphere 


ech? Sh 
p: = S erta =| 
87 Br 


k= e7%ro(1 4B, +6r,2/2)(Bre—1)-. 


(9a) 


The two constants c and £ were adjusted so as to 
fit the square of the calculated wave function (7) 
best. 

The potential corresponding to this charge 
distribution contains also two parts: ®,, corre- 
sponding to the constant part pı and ®; corre- 
sponding to ps which could be easily calculated, 
and hence also the interaction of the electron 
clouds. 
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For the calculation of the exchange energy the 
wave functions of the electrons with momentum 
p were assumed to be in the form e‘(»-2)/hy(r) 
where x(r) is the wave function of the lowest 
state. Under these conditions, the exchange 
energy is? II, 2 IH 


e? 1 
-= f ante) — dir, (10) 
where 
sin (r/d) —(r/d) cos (r/d)? 
an | 
(r/d)§ 


r r\ 2 
-erdı+1(-)+12(Z) | (10a) 
d d 


with d=0.52ir,. All the integrations could be 
carried out in elliptic coordinates. The result is 
evidently a quadratic function in c. The constant 
term is simply the exchange energy for flat wave 
functions (c=0), i.e., 0.916Ry/r,. The linear 
terms vanish as may be seen from the following. 
The integral of the first term in pı(r) times 
e(|r-r'|)/|r—r’| over r is independent of r’. 
The product of this quantity and the terms 
containing c in p(r’), integrated over r’, must 
vanish, since the integral of p(r’) is independent 
of c. By analogy the other linear term vanishes 
also and the sole remaining correction to the 
exchange energy of 0.916Ry/r, goes with œ. 
Hence this is a second order effect? of the 
deviation from flatness of the wave function (9), 
and smaller than the increase of the interaction 
of the charge clouds which is a first order eflect. 

Next the change of the correlation energy, due 
to the uneven charge distribution, has to be 
considered. In the case of even distribution, the 
correlation energy is certainly smaller than 
0.292e?/r,, even for a very great lattice constant. 
For the actual uneven distribution in a greatly 
expanded lattice, however, this energy is as 
follows. Hartree’s equations, solved for many 
electrons ın the field of many ions, give as wave 
functions of the electrons a sum of spherically 
symmetric maxima around every ion. If all ions 
are sufhciently far removed from each other, 


8 This differs from the case for Li. 
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these waves—let their form be «(r)—no longer 
overlap. The total interaction energy per electron 
is 


afee) |ulr)|?°4rr?dr (11) 


where (r) is the potential of the charge distri- 
bution | u(r) |?. Since no such interactions actually 
exist, Hartree’s equation gives an incorrect 
result by this amount, which will be compensated 
by the exchange and correlation energy. 

If Fock’s picture is used instead, the interaction 
energy between electrons with parallel spins 
vanishes in consequence of the enormously 
increased Fermi hole. There remains still the 
interaction between electrons with antiparallel 
spin, by which amount Fock’s picture is still in 
error. This remaining half of (11) must be 
nullified by the correlation energy. It follows 
that this energy must greatly exceed 0.292e?/r,, 
the largest value possible for flat functions. We 
see that the correlation energy is increased by the 
modulation of the waves by a greater factor 
than the exchange energy, and it becomes in the 
limiting case of an infinite lattice just one-half of 
(11), approximately {Ry per electron.? 

We have increased the correlation energy for 
flat waves, given in,?7! by the same factor by 
which the exchange energy appeared to be 
increased. The total change of the electron 
interaction energy was positive, and plotted in 
Fig. 2 also. 

Finally, the correction for the Madelung. 
number, as given in appendix 2, Reference? ™ is 
given in the same plot. The sum of all three 
corrections has been added to the curve of Fig. 3 
and the upper full line of this figure obtained. 
This curve assumes its minimum of —1.05Ry 
(heat of vaporization about 16 Cal.) for 7,=1.63. 
Bohr units, corresponding to a density of 0.62. 
The possible error in these figures is quite large, 
because the correction quantities are so great. 


(5) 


The last quantity which had to be calculated 
for the thermal constants of metallic H was the - 
zero-point energy corresponding to the half- 


° We emphasize this point since doubts have been raised 
whether the correlation energy actually exists. 
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-1.02 
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Fic. 3. Energy of the lattice as a function of lattice 
constant. Lowest curve, for flat wave functions; second 
curve with all corrections, except zero-point energy of 
nuclei; the dotted lines contain the zero-point energy, the 
lower for heavy, the upper for ordinary H. 


quanta vibration of the nuclei. If the elastic 
constants of the medium were known, the elastic 
spectrum and the zero-point energy could be 
calculated following Blackman’ or a paper of 
Barnes, Brattain, and Seitz." Since, however, 
only the compressibility can be derived from 
Fig. 3 (and this only very inaccurately, as it may 
be in error by as great a factor as two), it was 
performed by the older method of Debye." 

From the curvature of the upper curve in Fig. 
3 one obtains for the reciprocal compressibility 
of a lattice of very heavy hydrogen, 


4(¢1:+2¢12) = 2.9K 10" dyne/cm? for r,=1.5 
= 1.510" dyne/cm? for r,=1.6 
= 1.110" dyne/cm? for r,=1.7 
=0.8 10" dyne/cm? for r,=1.8. 


By analogy with other metals c1, = 2¢12 = 4¢44 was 
assumed. The zero-point energy is then about 


10 M. Blackman, Proc. Rov. Soc. London A148, 365, 384, 
(1935); 149, 117, 126 (1935). 

u Barnes, Brattain and Seitz, Phys. Rev. 48, 582 (1935). 

12: Cf. e.g., E. Schrödinger, Geiger-Scheel Handbuch der 
Physik, Vol. X. 
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0.0244 (E” P Ry, where E£” is the second derivative 
of the energy in Ry with regard to r, in Bohr 
units. For the isotropic hydrogen with mass two 
the value is 0.0173(E”) Ry. The dotted lines of 
Fig. 3 give the final results for the lattice energy, 
the upper one is for light, the lower for heavy 
hydrogen. The density of the former is 0.59, its 
heat of formation out of atomic hydrogen 10 Cal. 
We think that this figure is somewhat too low 
because the correlation energy is greater than 
we assumed it to be. 


(6) CONCLUSION 


The great density of the hypothetical metallic 
form of H is not surprising. It seems to be the 
general rule that the density of the metallic and 
valence lattice of a substance be greater than 
that of the layer lattice, and this in turn has 
greater density than the molecular lattice. Table 
I illustrates the point. 

This table makes it natural to assume that all 
atoms are bound together by shared electron 
forces in metallic and valence lattices, and that 
this is the case also for the atoms within a layer in 
layer lattices. The different layers are bound 
together, however, more weakly, perhaps by 
van der Waals forces. This is well in accord with 
the general geometric considerations of A. Reis 
and K. Weissenberg.!} It seems sensible, there- 
fore, to speak about valence-like forces between 
two atoms in a lattice, if the density of the 
outermost electrons of both overlaps considerably 
at this distance.!* 

The heat of formation of the ordinary, molecu- 
lar Hg lattice is 52.5 Cal., practically one-half of 
the heat of dissociation. Although the figures of 
the preceding section may be in error by quite a 
few calories, it is evident that the molecular form 
is so much more stable (by about 40 Cal.) than 
the metallic, that the chances of obtaining the 
latter are extremely small. 

In the ordinary lattice, there are two “nearest 
distances.” The distance between two atoms of 
the same molecule is 0.75A =1.45a while the 
nearest distance between two H atoms of 


13 A, Reis, Zeits. f. Physik 1, 204 (1920); TE (1920); 


K. Weissenberg, Zeits. f. Krist. 62, 12, 52 (19 

4 This corresponds also to the ideas developed by L. 
Pauling, J. Am. Chem. Soc. 53, 1367 (1931); J. C. Slater, 
Phys. Rev. 38, 1109 (1931). 
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TABLE I. 
Suascnes DENSITY DENSITY 

As, metallic 5.72 yellow 2.03 
Diamond 3.51 graphite 2.24 
Black phosphorus 2.70 yellow 1.83 
Se, metallic 4.82 red 4.47 
Sn, white 7.28 grey 5.76 


different Hz molecules is about 3.3A =6.3ao."° If 
one compresses the ordinary lattice, the latter 
distance will decrease, the former increase, as 
follows already from the calculations o F. 
London.! Thus, the increase of the pressure 
brings the intermolecular distances nearer to the 
intramolecular distances, and the lattice will 
become more similar to a simple lattice than it is 
at low pressures. It is possible that this goes on 
continually, and that one reaches a simple close 
packed lattice with metallic properties only in 
the limiting case of infinite pressures. 

During the increase of pressure, the total 
energy of the substance will increase by the work 
done (we assume very low temperatures). It is 
possible, thus, that the energy of the continu- 
ously compressed lattice becomes at a certain 
volume, assumed at a very high pressure, greater 
than that of another modification. In this case 
there will be a discontinuous change to the latter.!” 
Whether or not such a transition will occur into 
the metallic modification, depends on the magni- 
tude of the compressibility of the molecular form 
at high pressures. The smaller this is, the greater 
will be the pressure necessary for a certain 
change in volume, and also the energy increase 
at this compression. If the extremely high 


, ie calculated nearest distance for the metallic form 
is 1.5A. 

16 F, London, Zeits. f. Elektrochemie 35, 552 (1929); 
H. Eyring, J. Am. Chem. Soc. 53, 2537 (1931). 

17 Such polymorphic transitions induced by pressure in 
ae by P. W. Bridgman, Rev. Mod. Phys. 
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compressibility at ordinary pressures,'® which is 
2.10-® cm?/dyne, would hold throughout, the 
molecular form would be stable for all volumes. 
This is not possible, however, and the com- 
pressibility certainly decreases with increasing 
pressure, but one cannot tell, at present, whether 
it decreases sufficiently to make the metallic 
form stable at any pressure. One calculates easily, 
that even under the assumption of the most 
advantageous compressibility at high pressures, 
the pressure necessary for the transformation is 
250,000 atmos., which is outside the scope of the 
present technique. 

The objection comes up naturally that we have 
calculated the energy of a body-centered metallic 
lattice only, and that another metallic lattice 
may be much more stable. We feel that this 
objection is justified. Of course it is not to be 
expected that another simple lattice, like the 
face-centered one, have a much lower energy,— 
the energy differences between these forms are 
always very small. It is possible, however, that a 
layer-like lattice has a much greater heat of 
formation, and is obtainable under high pres- 
sure. This is suggested by the fact that in most 
cases of Table I of allotropic modifications, one 
of the lattices is layer-like,!® the other either 
metallic, valence, or molecular. The difficulties 
for such an experiment will be greatly increased 
by the necessity of the formation of a nucleus 
for the new lattice. 


18 Measured by A. Eucken for the liquid form, Ber. 
deutschen Phys. Ges. 4 (1916). 

19 Diamond is a valence lattice, but graphite is a layer 
lattice (A. W. Hull, Phys. Rev. 10, 661 (1917)). Yellow 
arsenic and phosphorus are evidently molecular lattices, 
black phosphorus a layer lattice (R. Hultgren and B. E. 
Warren, Phys. Rev. 47, 808 (1935)), and metallic As is 
also only approximately a simple lattice. The red, mono- 
clinic selenium can be said to form a molecular lattice (F. 
Halla, F. Bosch, E. Mehl, Zeits. f. physik. Chemie B11, 455 
(1930)), while the metallic modification is a thread lattice 
(A. J. Bradley, Phil. Mag. 48, 477 (1924)). The situation 
seems to be most complicated with tin. Grey tin forms a 
diamond lattice, but shows otherwise no similarity to the 
valence lattice of diamond, while the metallic lattice has a 
rather complicated structure (H. Mark and M. Polanyi, 
Zeits. f. Physik 18, 75 (1923)). 
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(1) Paysiıcs always develops in two 
directions. One front pushes forward 
towards phenomena which do not yet fit 
into the general picture, and the victorics 
on this front are marked by important 
changes in our fundamental concepts. 
On this front to-day the main struggle 
is for a better understanding of nuclear 
phenomena by the application of both 
theory and experimentation. But, apart 
from this search for new concepts, there 
is a constant effort directed toward the 
deepening and broadening of our knowl- 
edge of phenomena which, we believe, 
can be understood on the basis of exist- 
ing concepts and theories. Doubtless 
this second front is of less importance. 
It rarely leads to fundamental discover- 
ies in physics proper but supports rather 
the studies on the borderline of this 
science, such as physical chemistry and 
the applied sciences. Spectroscopy sud- 
denly changed, about six years ago, from 
the first to the second category, and not 
much later it became apparent that the 
study of the solid body belongs also to 
this second elass. In spite of this, it 
remains one of the most attractive of all 
fields, since it deals in a scientific way 
with those subjects with which we must 
deal in our everyday experience. For 
example, we are never afraid when 
dropping a key that it will fly to pieces, 
as glass would, nor do we fear that a gold 
coin will dissolve in water nor evaporate 
if left for awhile in the open air. 

X-ray studies have revealed that most 
of the solid bodies in our surroundings 
are crystalline. This does not neces- 
sarily mean that they are formed by one 
single crystal—although even this is true 
for bodies of such enormous size as ice- 
bergs. More commonly, they are poly- 


crystalline, like the metal parts of 
ordinary tools, ùe., a conglomerate of 
microscopic crystals of various sizes. 
Crystalline in this connection does not 
mean a regularly shaped body of the 
kind we see in our crystallographic col- 
lections, but only that the grains have a 
regular inner structure arising from the 
arrangement of the atoms in surprisingly 
regular lattices. Samples of such lattices 
are shown in Fig. 1. (The circles repre- 
sent the centers of atoms; the lines have 
no physical significance and are drawn 
only in order to facilitate space-vision. ) 
The region over which the regular ar- 


rangement has a certain orientation is 


called a mieroerystal and may have a size 
anywhere from .00001 mm to 1 mm or 
even more. These microcrystals, gen- 
erally possessing irregular boundaries, 
are heaped together in an apparently 
random manner to form the polycrystal- 
line body. (Very little is known about 
how the microcrystals with their differ- 
ent orientations fit and stick together. 
Some assume a separate very thin non- 
crystalline phase which ‘‘pastes’’ them 
together, but there is no definitive evi- 
dence for this. ) 

The crystalline and polycrystalline 
substances constitute by far the greater 
part of all solid bodies found in nature. 
Practically all rocks are conglomerates 
of crystals, ice is crystalline, and so are 
all metals. The grains of sand are 
minute crystals and loam also is crystal- 
line. Apart from the glasses and sub- 
stances of organic origin, like wood, 
there are very few non-crystalline solids. 

(2) A distinction not necessary in the 
case of gases or liquids must be made 
between different kinds of properties of 
solids. | 
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Evidently, the consideration of a regu- 
lar lattice is much simpler than that of 
an irregularly spaced heap of atoms. It 
is important, therefore, that many of the 
properties of a polyerystal are the same 
as those possessed by a perfect single 
crystal. These properties are connected 
with phenomena which affect the bulk of 
the material, like vaporization, fusion, 
specific gravity and compressibility. 
Our understanding of these ‘‘insensitive 


properties’’ is naturally the farthest. 


advanced, and we shall devote most of 
our attention to them. 

Unfortunately, a great many very 
important properties belong in a sec- 
ond ‘‘sensitive’’ class. The breaking 
strength, for instance, is determined by 
the very weakest part of the erystal; one 


single imperfection of certain types may 


suffice to cause rupture under a very low 
stress, a tenth or even a hundredth of 
that which a perfect crystal could sup- 
port. Fig. 2 gives a rough picture of 
how this can happen: the stress, charac- 
terized by the stress lines, concentrates 
in the neighborhood of the imperfection 
and attains values which are many times 
those in the bulk of the material. This 
highly concentrated stress can widen the 
notch and finally break the whole body. 
Thus, the parts of a solid which lie above 
and below a crack not only do not in- 
erease the strength of the material but 
very definitely weaken it. One can say 
that the strength of a solid is much 
smaller than that of its weakest part. 

The situation for the electric break- 
down of insulators parallels that for the 
elastic limit (the smallest stress which 
causes a permanent deformation), and 
the study of these sensitive properties of 
erystals involves besides a knowledge of 
the erystal in bulk, its criminology, t.e., 
a knowledge of the most usual faults and 
imperfections. 

In addition to these extremes, there 
are, of course, a number of borderline 
properties. These are partly connected 
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Fig. la. PART or A KCl LATTICE. THE SHAD- 
OWED SPHERES DENOTE THE POSITIONS OF THE 
K 10NS, THE EMPTY ONES THE POSITIONS OF THE 


-Cl rons. THE DISTANCE BETWEEN NEAREST NEIGH- 


BORS IS .00000031 mm. ORDINARY ROCKSALT 
HAS THE SAME LATTICE WITH SOMEWHAT 
SMALLER DIMENSIONS. 


Fic. lb. Part OF THE LATTICE OF AN ALKALI 
METAL. THE SPHERES REPRESENT THE CENTERS 
OF MASS OF THE ATOMS. THE DISTANCE BETWEEN 


NEAREST NEIGHBORS IS .000000372 mm IN SODIUM. 


Fig. le. UNIT CELL OF THE DIAMOND LATTICE. 

THE DISTANCE BETWEEN NEXT NEIGHBORS IS 

.000000154 mm. Si HAS A SIMILAR LATTICE WITH 

A DISTANCE OF .000000234 mm BETWEEN. NEAREST 
NEIGHBORS. 
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with the external surface, as, for ex- 
ample, the thermionic emission of elec- 
trons, or with the internal boundaries of 
erystallites, exemplified by the electric 
conductivity of compressed salts. All 
these properties are influenced to some 
extent by small contaminations. With 
extreme care and sufficient experimen- 
tal skill reproducible results are some- 
times obtainable for these phenomena, 
and they are then frequently as amen- 
able to theoretical interpretation as the 
insensitive properties. 

(3) Let us return now to the insensi- 
tive properties. Even with regard to 
these, the variety found in solids is much 
greater than that in gases. From the 
empirical point of view, four main 
classes, with many transitions between 
them, can be distinguished. This classi- 
fication, which in its essentials goes back 
to Grimm, contains: 

(a) Molecular lattices. Inert gases or 
saturated compounds like He, Ne, A, 


l t 


Fig. 2. 


etc, H,, N, Op ete, CH, C,H, H3O, 
H,S, ete, and all organic compounds 
form such crystals. They all have low 
heats of vaporization and condense only 
at comparatively low temperatures. 
They are soft and moderately brittle, 
are good insulators and are transparent, 
except in spectral regions in which the 
building molecules themselves show 
absorption. 

(0) Metals have in many respects 
properties opposite to those of class a. 
The binding forces between the atoms 
are much higher and the heat of vapori- 
zation greater, and they have an in- 
creased hardness. Their most remark- 
able property is, of course, that they are 
good conductors for electricity and heat. 
They are opaque and owe many of their 
important applications in industry to 
their plasticity ; that is to say, they break 
only after great deformations.? Their 
solubility in each other is considerable 
(alloys), but they never dissolve in sol- 
ids of other classes. 

(c) Valence lattices (diamond, quartz, 
carborundum) and 

(d) Ionic lattices (salts) are rather 
Similar types. They both have high 
heats of vaporization, strong cohesive 
forces, are transparent like molecular 
lattices, are good insulators and are hard 
and brittle. The main difference be- 
tween them is that while the former are 


formed from neutral atoms, the building 


stones of the salts are electrically charged 
ions, held together by the electrical 
attraction between opposite charges. 
They dissolve, therefore, in liquids with 
high dielectric constants like water, 
which diminish the electrical attraction 


of the ions down to a small fraction of 


its original value. 


1 This is why they do not break if dropped. 
The sudden stopping on the ground causes great 
stresses. In consequence of this, the metal will 
suffer a plastic deformation which will not 
cause rupture, however. In consequence of 
the plastic deformation the metal will act as 
its own shock absorber by allowing more time 
for the stopping of the bulk of the material. 
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This characterization of the four 
groups of solids should be understood in 
the same sense as should a similar char- 
acterization of a class of plants in botany. 
It does not give ironclad rules, but rather 
ideals from which the real cases often 
deviate; especially is this true for the 
more complicated compounds. Also 
various kinds of transitions occur be- 
tween the four groups. Sometimes in- 
side individual layers we have a lattice 
of one kind, while the forces between the 
layers are characteristic of another of 
our classes. There are also cases which 
are really transitional in all their be- 
havior between two (or even three) 
groups, especially between valence and 
ionic lattices. 

These exceptions are rare, however. 


The importance of the four groups be- 


comes most evident, perhaps, if we 
realize that instinctively we classify into 
one of these groups all solid bodies of 
inorganie origin, which happen to fall 
into our hands. The above characteriza- 
tion of the four groups is the scientific 
description of what all of us would ex- 
pect with regard to vaporization, hard- 
ness, electric conductivity and brittleness 
after some inspection and handling of 
such substances as condensed CO,, 
rhodium, carborundum and Glauber’s 


salts, even if we had never seen them 


before. On the other hand, we wouldn’t 
quite know what to expect from transi- 
tion lattices such as carbide or even 
graphite. 

(4) The enormous differences between 
the physical properties of different kinds 
of lattices make it evident that the forces 
holding the atoms or molecules together 
are very different in the four cases. In 
order to understand the origin and na- 
ture of these forces, we must first recall 
the structure of isolated atoms and mole- 
cules. This is probably well known to 
the readers of the Screntiric MONTHLY. 
It is only recently that Professor Eyring 
gave an excellent review of this subject 


in these pages.? According to Ruther- 
ford, the atom contains, first of all, a 
heavy nucleus, containing all the positive 
charge and (except for about one part in 
two thousand) all the mass of the atom. 
The center of gravity of the atom prac- 
tically coincides with the nucleus, so that 
in Fig. 1 the circles may be regarded 
alternatively as the positions of atoms or 
nuclei. This nucleus, though small, is 
full of mysteries, which fortunately are 
of no importance in understanding the 
solid state. The negative charges, which 


‘exactly compensate the positive charge 


of the nucleus of a neutral atom, are car- 
ried by light particles, the electrons. 
These electrons surround the nucleus like 
an enormous cloud with dimensions a 


-hundred thousand times that of the 


nucleus, although the cloud is itself only 
about .0000001 mm thick. Quantum 
mechanics, created by Heisenberg, 
Schrodinger and Dirac, unravelled for 
us about eight years ago the exact laws 
of motion of this electron cloud. It is 
now possible to calculate the density of 
this cloud at different distances from the 
nucleus, and from this one would nat- 
urally expect to obtain important infor- 
mation concerning the structure of 
solids, by comparing the density distri- 
bution of the electrons for different 
distances in the lattice. The outermost 
or valence electrons are responsible for 
the entire chemical behavior of the 
atoms. In Fig. 3, the full line represents 
the density of the valence electrons as a 
function of the distance from the nu- 
cleus. In addition to this, the position 
of the nearest neighbor is marked on the 
abscissa and the density distribution of 
the valence electron of this neighbor is 
plotted in the direction of the first atom 
as the dotted line. The first plot is for 
He, the most characteristic representa- 
tive of a molecular lattice; the second is 
for sodium, a typical metal; the third for 


2 SCIENTIFIC MONTHLY, 39: 415-419, Novem- 
ber, 1934. 
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Fig. 3a. CHARGE DISTRIBUTION OF TWO NEIGHBORING He ATOMS IN THE LATTICE. 


Fig. 3b. CHARGE DISTRIBUTION OF EXTERNAL ELECTRONS IN FREE Si ATOM (FULL LINE). THE 
DOTTED LINE IS THE CHARGE DISTRIBUTION OF THE VALENCE ELECTRONS OF ANOTHER Si ATOM, 
PLACED AT THE SAME DISTANCE FROM THE FIRST AS IN THE LATTICE. 


Fig. 3e. CHARGE DISTRIBUTION OF THE VALENCE ELECTRON IN FREE SODIUM ATOM (FULL LINE). 

THE DOTTED LINE IS THE CHARGE DISTRIBUTION OF THE VALENCE ELECTRON OF ANOTHER SODIUM 

ATOM, PLACED AT THE SAME DISTANCE FROM THE FIRST AS THE NEAREST NEIGHBOR IN THE LATTICE. 

FIG. 3d. CHARGE DISTRIBUTION OF EXTERNAL ELECTRONS IN K 10N (FULL LINE) AND Cl ION 

(DOTTED LINE). THE DISTANCE OF THE ZEROS OF THE TWO PLOTS IS EQUAL TO THE DISTANCE OF 
THE NEAREST IONS IN THE KC] LATTICE. 


the valence lattice of silicon and the last 
one is for KCl, which closely resembles 
ordinary rocksalt. 

We realize at once an important dif- 
. ference between the molecular and ionic 
lattices (first and last pictures) on the 
one hand, and the metallic and valence 
lattices on the other. For the former, 
the overlapping of the electron clouds is 
small, in the latter ones it is so great that 
it is impossible to tell to which atom a 
certain valence electron belongs. In the 
former cases the constituent atoms or 
ions, although attracted by their neigh- 
bors, have their charge distribution but 
slightly affected. This is not so for the 
metals and valence lattices. There is no 


me 


region between the atoms with a small 
charge density and consequently no for- 
bidden region for the electrons. The 
electrons are able to pass from one atom 
to the next. Thus the valence electrons 
move freely and are common to the whole 
lattice. This is of decisive importance 
for the properties of these substances. 

In molecular and ionic lattices, it is 
possible to consider the constituents as 
different entities. Born’s classical 
theory of mechanical electric and ther- 
mal properties, which treats the atoms 
and ions of the lattice as individuals, 
attained its great successes for these 
lattices. 

The great differences in the behavior 


be 64 


Fig. 4. 
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of the two classes are due to the different 
character of the constituents. These are 
neutral atoms in the first case, in the 
ionic lattices they are charged particles. 
The electric forces between ions are very 
strong, and this makes the cohesive 
forces, the heat of vaporization and 
hardness great. The distances between 
neighboring ions are given by the charge 
distribution, as illustrated in Fig. 3. 
The calculations of ionic radii were car- 
ried out by L. Pauling in California and 
show remarkable azreement with the 
values derived by Goldschmidt from 
observations. These lattices are always 
so constructed that the positive ions are 
surrounded by negative ions, the nega- 
tive ions by positive ones. (Cf. the NaCl 
lattice in Fig. 1.) Since opposite 
charges attract each other, there are con- 
siderable forces holding these lattices 
together. | | 

The nature of the forces in molecular 


lattices is not so simple. Van der Waals 


was the first to assume that condensation 
is caused by the same forces which are 
responsible for the deviation in the be- 
havior of real gases from the ideal gas 
laws. This proved to be true in the case 
of molecular lattices, and the laws of 
these forces have been recognized by 
London and Wang on the basis of quan- 
tum mechanics and called van der Waals 
forces. 

Of course, there is no attraction due 
to electric charges in molecular lattices, 
since the constituents are uncharged. 
And, indeed, the attraction can not be 
understood as long as we consider the 
electrons as charge clouds. But if we 
remember their corpuscular nature, we 


realize that they can form dipoles with 


the nucleus. The direction of this dipole 
will vary quickly because of the quick 


motion of the electrons. There will be 


no force, in the mean, on a dipole of con- 
stant orientation, since the attraction for 
one dipole orientation is as great as the 
repulsion for the opposite orientation— 


and all dipole directions are equally 
probable. But if two variable dipoles 
face each other, it will be possible that 
the two attractive configurations a and 


c of Fig. 4 will oecur more often than the 


repulsive configurations b and d, al- 
though all the orientations for the single 
dipoles are equally probable. London 
and Wang have shown that this is actu- 
ally the situation, and thus laid the 
foundation not only for a satisfactory. 
theory of molecular lattices but also for 
a theory of the behavior of real gases. 
Naturally, the van der Waals forces 
are much smaller than the Coulombic 
forces between ions. Thus, the cohesion 
in molecular lattices is small, the vapori- 
zation easily giving volatile substances. 
Also it is evident that these very small 
forces will be important only if no other 
stronger forces are present. Molecular 


lattices will be formed by saturated com- 


pounds and inert gases. 

(5) Fig. 3 shows us that the metallic 
and valence lattices form the more com- 
pact modification of matter, as con- 
trasted with molecular lattices and salts. 
This gives important information con- 
cerning the question of the behavior of 
solids under extremely high pressures: 
according to Bernal, who first empha- 
sized this point, they will go over into 
metals or valence lattices. A convincing 
piece of evidence for this point of view, 
which is quite independent of calcula- 
tions of charge distribution, is furnished 
by the phenomenon of allotropy. This 
is the name given the phenomenon of the 
appearance of the same chemical element 
in different ‘‘modifications’’ with widely 
different physical and chemical proper- 
ties. The ordinary, yellow (white) form 
of phosphorus forms a somewhat compli- 
eated molecular lattice. It is a good in- 
sulator, soft, dissolves in organic solvents 
and has a density of 1.88. Bridgman at 
Harvard subjected this element to very 
high pressures, and the lattice ‘‘col- 
lapsed.’’ It transformed into black 
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phosphorus, which has a density of 2.70, 
is a fairly good conductor of electricity 
and insoluble in organic liquids. And 
this is the general rule: Whenever an 
element has two allotropie modifica- 
tions, the metallic or valence form has 
the higher density. The following table 
illustrates this point: 


As, metallic „e 5.72 yelloW „um 2.03 
diamond... 3.51 graphite „nem 2.24 
black phosph..... 2.70 yellow nm 1.83 
Se, metallic m.. 4.82 red „nme 4.47 
Sn, white e 7.28  BIey „nn 5.76 — 


Caleulations made in our laboratory 
by H. B. Huntington show that metallie 
hydrogen should also exist, though only 
under extremely high pressures, and that 
it should have a density many times 
higher than that of the usual molecular 
form. 

I shall not go into detail with regard 
to the next question which naturally 
arises—the. cause of the fundamental 
difference between valence and metallic 
lattices. Although both form in the 
compact modification of matter, apart 
from the high heat of vaporization and 
boiling point, they have nothing in com- 
mon. The reason for this is deeply 
rooted in the principles of quantum 
mechanics and has been brought out but 


lately by works of Peierls and Brillouin. 
According to their investigations, it is 
essential for a valence lattice that the 
number of valence electrons be even, and 
this rule holds without exceptions. We 
owe much valuable information concern- 
ing the structure and crystal form of 
valence lattices to Pauling and Slater, 
but a review of their work would greatly 
exceed the scope of this report. 

I hope that I have succeeded in im- 
parting to the reader the impression that 
the foundations for the understanding 
of the nature of the solid state are laid. 
Still, it will require much thorough 
work, perseverance and many new ideas 
before we will be able to add the theory 
of solids as a finished story to the build- 
ing of physics and before we will be able 
to apply with success our knowledge in 
industry. | 

The progress in the explanation of the 
properties of solid bodies is due on the 
theoretical side to the newly developed 
quantum mechanics, and experimentally 
mainly to the study of crystal structure 
by x-rays. Without these tools we would 
face these problems as helplessly as we 
still face the problem of liquids where 
x-ray studies have proved less efficient 
so far. 
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It is well known that if the interaction between electrons 
ı a metal is neglected, the energy spectrum has a zonal 
ructure. The problem of these ‘‘Brillouin zones” is 
eated here from the point of view of group theory. In 
yis theory, a representation of the symmetry group ot 
re underlying problem is associated with every energy 
alue. The symmetry, in the present case, is the space 
‘oup, and the main difference as compared with ordinary 
-oblems is that while in the latter the representations 
rm a discrete manifold and can be characterized by 
tegers (as e.g., the azimuthal quantum number), the 
:presentations of a space group form a continuous mani- 
ld, and must be characterized by continuously varying 


I. 
"NVESTIGATIONS of the electronic structure 


- of crystal lattices in particular in metals, 
1ade on the basis of Bloch’s theory, led to the 
onception of the so-called Brillouin zones.! In 


* C. R. B. Fellow. 

I The existence of these zones was first noticed by 
f. J. O. Strutt, Ann. d. Physik 85, 129 (1928); 86, 319 
929); and then, independently, by F. Bloch, Zeits. f. 
hysik 52, 555 (1928); cf. also P. M. Morse, Phys. Rev. 
5, 1310 (1930). From another point of view, they were 
scussed by R. Peierls, Ann. d. Physik 4, 121 (1930). 
heir connection with x-ray reflection was first pointed 
it by L. Brillouin (cf. e.g., Die Quanlenstatistik (Berlin, 
3077. Important physical applications were given by 
. Jones, Proc. Roy. Soc. A144, 225 (1934); 147, 396 
934); H. Jones, N. F. Mott and H. W. B. Skinner, Phys. 
ev.45, 379 (1934); J. C. Slater, Phys. Rev. 45,794 (1934): 
ev. Mod. Phys. 6, 209 (1934); F. Hund and B. Mrowka, 
er. Sachs. Akad. D. Wiss. 87, 185, 325 (1935). Compare 


paranıeters. It can be shown that in the neighborhood of 
an energy value with a certain representation, there will 
be energy values with all the representations the pa- 
rameters of which are close to the parameters of the 
original representation. This leads to the well-known 
result that the energy is a continuous function of the 
reduced wave vector (the components of which are pa- 
rameters of the above-mentioned kind), but allows in 
addition to this a systematic treatment of the “sticking” 
together of Brillouin zones. The treatment is carried out 
for the simple cubic and the body-centered and face- 
centered cubic lattices, showing the different possible 
types of zones. 


spite of these investigations, which cover a large 
part of the field, it seems desirable to develop 
the theory from a unique point of view. It 
appears that taking into account special sym- 
metry properties of different lattices brings out 
interesting features of the constitution of the 
B-Z which are not evident from the existing 
general theory. These features can be dealt with 


also F. Hund, Zeits. f. tech. Physik 16, 331, 494 (1935); 
Zeits. f. Physik 99, 119 (1936). Hund’s work deals with 
those properties of the Brillouin zones which are common 
to all zones of the same lattice (as matter of fact he does 
not discriminate between different types of zones at all). 
We consider here the different types of zone separately. 
The differences between the different types are of the 
same kind as e.g. the difference between even and odd 
terms in atomic spectra. It is surprising that there are at 
all common properties of all zones but Hund has shown 
that this is the case for the more complicated crystal 
structures. 
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uniformly by the methods of group theory,? and 
we propose to take up the subject here from this 
point of view. The first start in this direction 
has been made by F. Seitz,? and we shall use his 
results extensively, though a knowledge of his 


work should not be necessary for the under- - 


standing of this paper. 

In the theory of Bloch, every electron has a 
separate wave function. This assumption is 
identical with the Hartree-Fock approximation 
method and amounts to neglecting the statistical 
correlations between electrons. If we neglect 
these correlations, every electron obeys a sepa- 
rate Schridinger equation of the type 


h? 70° Ë 0? 
-—(—+—+—) + W=Ey (1) 
2m\dx? dy? az? 


in which V contains the ordinary and exchange 
potentials of the ions and electrons. The po- 
tential V has the whole symmetry of the lattice, 
that is, the group of our Schrödinger equation (1) 
is the space group of the lattice. 

It is clear from the ordinary group theory? 
that every characteristic value of (1) belongs to 
a certain representation of the space group and 
the dimension of the representation is equal to 
the number of characteristic functions belonging 
to this characteristic value.’ Thus far the group 
theory of the B-Z is not different from the group 
theory of any other system. But while in atoms, 
molecules, etc., the characteristic values of (1) 
are well separated, the characteristic values of 
(1) for a crystal form a continuous manifold. 
There will be several characteristic values in the 
neighborhood of any one E and the representa- 
tions of these characteristic values will be said 
to form the neighborhood of the representation 
of E for this B-Z. Thus a certain topology for 
the representations must exist and it will be 
shown that part of this topology is independent 

2? Cf. eg., E. Wigner, Die Gruppentheorie und thre An- 
wendungen (Braunschweig, 1931). The first application of 
group theory to crystal lattices has been given by H. Bethe, 
Ann. d. Physik 3, 133 (1929). 


3 F. Seitz, Ann. of Math. 37, 17 (1936). 

tL. Brillouin, Actualités Scientifiques et Industrielles 
(Paris, 1933). 

s To the symmetry operations of the space group, the 
“reversal of time" (cf. E. Wigner, Gott. Nachr. 546 (1932)) 
should be added. It has been remarked by F. Hund (refer- 
ence 1) that this will often be of great importance. It can 
be omitted, however, in the case of the cubic lattices 
investigated here. 
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of the special B-Z. Even if K. 2’, be in 
different B-Z but have the same representation, 
there will be energy values neighboring £ (with 
a few exceptions) with the same representations 
as those of energy values neighboring Z’, etc. 
The investigation of the “topology” of repre- 
sentations will he essentially the subject of this 
paper, from the mathematical point of view. 


II. 


We must review next, the theory of representa- 
tions of space groups. F. Seitz? has shown that all 
space groups are soluble groups and their repre- 
sentations can be obtained according to the 
general theory for these. ë Seitz first considers 
the invariant subgroup formed by the transla- 
tions. Since these commute, the corresponding 
matrices in the representation can be assumed to 
have the diagonal form. This means that we shall 
consider such linear combinations Y, (u=1, 2, 
-+-2, where z is the dimension of the representa- 
tion) of the wave functions, which are merely 
multiplied by constant factors (‘multipliers’) 
Wut, Wy2, waa if a displacement by the three 
elementary identity periods is made. In other 
words, the matrix corresponding to the displace- 
ment by the first elementary identity period is a 
diagonal matrix with the diagonal elements wi, 
wg, -, Wat, with similar matrices for the 
representatives of the other displacements. Since 


all matrices must be unitary, |w,i] =| e,2| 
= |w,3| =1; and if one writes 

= eikzzıtkyntkzzı) 

ayaa e'tkzratkunatker:) (2) ; 


Wyu3= eitkz z3+ kyyatk 223) 


with Xu Yu 21, Xn Vo, Za X3, Ya Za the x, y, 2 
components of the first, second and third identity 
periods, the vector k is called’ “the reduced 
wave number vector.” Of course, k will be, in 
general, different for the different wave functions 
Yi, Wo, +t, Yn It must be remembered, how- 


6G, Frobenius, Berl. Ber. 337 (1893); 1. Schur, Berl. Ber. 
164 (1906). 

7Cf. A. Sommerfeld and IT. Bethe’s article in Handbuch 
der Physik, Vol. 24 (Berlin, 1933), chapter 3. Also J. C 
Slater, Rev. Mod. Phys. 6, 209 (1934). For a simple cubic 


lattice Y= y= ced) yy Ho) Ny = Se SNS = 0. Fora face 
centered lattice yi = 21 en = 22 = ya = Ja 5 25 N= = =O, 
etc. 
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ever, that the reduced wave vector k is de- 
fined by (2) only up to an integer multiple of 
a vector r of the reciprocal lattice, i.e., a vector rT, 
for which 


Px tr yi Hr: = 2rnı 


(2a) 


Y.Xatryyatr:22= lene 
Fzx3 Frys + t23 = 2rN3 


always can be added to k, without changing its 
meaning. The space of k is periodic with all 
the periods r, satisfying (2a); two reduced wave 
vectors differing by such an r are considered 
identical. If there are no essential gliding planes 
and screw axes in the space group,® one needs to 
consider, in addition to the above translations, 
rotations and reflections only. If such a trans- 
formation is applied to y,, it will be transformed 
into a wave function, say Yn ‘the reduced wave 
vector of which arises from that of y, by just 
the rotation or reflection considered. Thus the 
reduced wave vectors of the wave functions of 
one representation all arise from one another by 
the pure rotations and reflections of the group, 
i.e., the elements of the crystal class. If the re- 
duced wave vector of one y, is transformed by 
every element of the crystal class into a different 
vector, this will be true for all of them, and we 
shall have as many wave functions Yı, ***, Wa 
as the crystal class has elements. The matrices 
of the representation corresponding to rotations 
and reflections will merely interchange the dif- 
ferent y,. If there are symmetry elements which 
leave a wave vector invariant, they form a 
group which we shall call the group of the wave 
vector. So, for example, if the wave vector lies in 
the x direction, its group will contain all rotations 
around x and all reflections in planes through x. 

A wave function y, with a wave vector k 
either is left invariant under the transformations 
of the group of k, or else transformed into a new 
Ya with the same wave vector, k, however. In 
the first case there will be only one wave function 
with the wave vector k. In the second case there 
will be several of them and they will transform 


3 We mean by this that all symmetry elements can be 
considered as products of two symmetry elements, the one 
of which is a pure translation, the other a pure rotation or 
reflection. This is the case in the most important space 
groups. 


SMOLUCHOWSKI 


AND WIGNER 60 


under the transformations of the group of k by 
an irreducible representation of this group, 
which we shall call the small representation. 
These are the results of Seitz. 

Hence the representations of the space group 


-must be characterized by two symbols. The 


first gives the reduced wave vectors (or set of w) 
which occur in the representation; the figure of 
all these wave vectors forms a “‘star’’ with all 
the rotational and reflection symmetries of the 
lattice. Three such stars are given in Fig. 1 for 


Md Wa Wad 


a b c 
Fic. 1. 


a two-dimensional quadratic lattice. The second 
symbol characterizes the small representation, 
which is an irreducible representation of the 
group of one wave vector (the groups of all wave 
vectors of a star are holomorphic). If the wave 
vectors lie in general positions (Fig. la) their- 
group will contain the unit element only. In 
this case the second symbol may be omitted. 
It may be emphasized again that two wave 
vectors must be considered identical, if the 
corresponding set of w’s is the same. Thus for 
example, if the three kax;+kyy:t+.2; are all 
integer multiples of + (not necessarily of 27) 
the wave vector kz, ky, k: is.identical with the 
wave vector —k,, —k,, —k, and the inversion 
(x——x, y>-y, z= —2) always belongs to the 
group of the wave vector.? 


III. 


We now consider an energy value E with a 
certain representation D and the wave functions 
Yi, -, Wa. If we multiply one of these by 
et(xseteyytez2) where kz, Ky, Kz, are the components 


® It is in this connection that the time reversal is impor- 
tant (cf. F. Hund, reference 1). If the crystal class does not 
contain the inversion, Fr, ky, ka will still be carried over into 
— kz, —ky, —kı by the "time reversal.” Since, as we shall 
see, the above consideration determines the surface of the 
B-Z, this will be fundamentally affected by the operation 
of time reversal. 
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of a very small vector, it will have the wave 
vector K+x and belong to a new representation 
D’. The set of new representations obtained in 
such a way will be called the neighborhood of D. 
It is clear that there will be near E, an E’ with a 
representation D’. For if y, satisfies (1), 
pyetazetayytet) = WY,’ satisfies 


h? h?i ð ð ð 
( —— â+ v) W+- (+ Ky—+ ) a 
2m m\ dx dy 0 
. h? 
= (Erw ++) yr. (3) 
2m 


In this equation the second term is small, and 
its negative value may be treated as a perturba- 
tion. Performing the perturbation calculation, 
we shall obtain a characteristic value E’ of (1) 
which is near E and the wave function of which 
will have the same translational symmetry as yı', 
since both the original operator in (3), and the 
perturbation 


A?i ð ð ð 
De ( Kz—-+ Ky—+ e~) 
Ox 02 


© 3; (3a) 


have.the whole translational symmetry of the 
lattice. 

This is all the general theory we need. If E 
had a star of the general type, the star of E’ 
also will be of the general type and our result 
merely states the well-known fact, that the 
energy is a continuous (and even differentiable) 
function of the components of the wave vector. 
The set of all energies and wave functions which 
may be obtained from one single energy level 
continuously by this operation, never touching a 
point in which the star degenerates, is properly 
defined as one Brillouin zone. The restriction to 
such representations, the stars of which are of 
the general type, is necessary for the definition 
of a Brillouin zone, since, as we shall see, two 
or more Brillouin zones may stick together for 
degenerated stars (as those in Figs. 1b and Ic). 

If we consider an energy value, the wave 
vectors of which are left invariant by some of 
the rotation or reflection operations, the situation 
still will be left essentially unchanged, if no two 
wave functions have the same reduced wave 
vector (the same multipliers). If, however, two 
or more (say s) wave functions have the same 
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wave vector, and we choose kz, xy, x, in such a 
way that the new wave vector (k+x) has the 
general position, there will be s orthogonal wave 
functions, with the wave vector k+x and with 
energies near E. Since for general wave vectors 
it never happens that two wave functions with 
the same wave vector belong to the same energy 
value, we. must conclude that they all belong to 
different B-Z which are very close for small « 
and that for the original energy value E these s 
B-Z “stick together.” The sticking together will, 
therefore, always occur for such wave vectors 
which are left invariant by some symmetry 
operations.!® 

We must investigate two more cases. First let 
x be such a vector that k+x still has the group 
of k. In this case, the small representation of E is 
equivalent to the small representation of F’. 
Otherwise the wave functions would have to 
change abruptly even for a small change of k. 
The sticking together will be the same along 
symmetry elements. 

In the second case, the group of k+x is only a 
subgroup of k, but still contains more than the 
identity. This case occurs, for instance, if we 
pass from a symmetry axis to a symmetry plane 
through this axis, or from the vector k=0 toa 
symmetry axis. The small representations of E’ 
will be irreducible representations of the sub- 
group, and if the small representation of E is 
not irreducible as representation of the group of 
k-+r, the B-Z which stuck together for k will be 
partly separated for k-+x. The small representa- 
tions in these B-Z will be, for k+x, the irre- 
ducible parts of the small representation of the 
group of k. 

The proposed characterization of a B-Z is 
given, hence, by the small representations of the 
groups of all wave vectors, which have a group 
greater than unity. For wave vectors lying in 
equivalent symmetry elements, the small repre- 
sentations are equivalent, and for a symmetry 
element which is a subgroup of another, the 
small representation must be contained in the 
small representation of the latter. Wherever 
the small representation is s dimensional, we 
have a sticking together of s B-Z, all of them 
having this same small representation for the 


10 Including the time reversal. 
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symmetry element under consideration. Again, 
it is important to remember that the group of 
the wave vector for which 


ki tk, yitkziı=nım (fori=th) (4) 


holds for one 7, say +=1%, contains all elements 
which transform k in such a way, that 
(kaxitkyy:+k.2;)/m7 remains an integer for 1=1, 
and is unchanged for the two other 7, since the 
corresponding wave vectors are all the same. 

The argument which shows that the small 
representation will be the same all along a 
symmetry element, breaks down for such points 
in which two B-Z touch each other, if no such 
touching is required by symmetry considerations. 
In the case that the energy for a certain value of 
k is the same in two B-Z, without this being the 
result of the symmetry, we speak of an accidental 
degeneracy.!?® In points of accidental degeneracy, 
the small representations of the two B-Z may 
be interchanged, but the case of such an acci- 
dental degeneracy is explicitly excluded from the 
following considerations. One can see that it 
does not occur for very large lattice constants, 
though it may occur for the actual ones. 

In the following sections, these results will be 
applied to the three most important cubic 
lattices, the simple, the face-centered, and the 
body-centered cubic lattices. Since, for instance 
all small representations of wave vectors in the 
fourfold axes are the same, this small repre- 
sentation will be called “the representation along 
the fourfold axis” and a similar notation will be 
used for the other symmetry elements. 

It has been pointed out by J. C. Slater! that 
the energy as function of k should be considered 
as a periodic, multivalued function, the periods 
being the vectors of the reciprocal lattice. The 
“discontinuities” then arise from considering for 
some k one, for other k other branches of this 
multivalued function. In our way of talking, 
the periodicity is expressed by the fact that two 
wave vectors differing by a vector r of the 
reciprocal lattice, are considered identical. It is 
convenient to single out from all sets of 
“identical” vectors one (generally the shortest), 
and not to consider the rest at all. The manifold 


10. The case of an accidental degeneracy will be treated 
in a paper by C. Herring, to appear shortly. We wish to 
thank Mr. Herring for interesting discussions on this 
subject. 


SMOLUCHOWSKI 


AND WIGNER 62 


of these “reduced wave vectors” forms the inner 
of the B-Z, their boundary in the k;,k,k, space 
(where the discontinuities are Aesumed ordi- 
narily) forms the surface of the B-Z. 

The energy as function of k has, furthermore, 


‚all the symmetry of the (reciprocal) lattice. 


This is clear, since wave functions with all the k 
of a star belong to the same representation, and 
have the same energy, hence. 


IV. 


We want to consider the effect of the time 
reversal, first. This transforms k into —k. Thus 
—k is always in the star of k, even if there is no 
inversion center present: the energy as function 
of k is always equal for k and —k. Just as for 
x-ray reflection, the inversion is always added to 
the symmetry of the problem." 

For a triclinic lattice, for instance, this means 
that the derivative of energy with respect to k 
is zero in the middle of the faces, edges and at 
the corner points of the B-Z, i.e., for 


k:xıtkyıtkza;=nır (i=1,2,3). (5) 


One can see directly also, that the group of these 
k contains the time reversal and the wave 
functions are real, hence. Thus the average 
value of the perturbation operator (3a) vanishes 
for these wave functions and the energy change 
goes with x. 

This cannot be claimed, however, for all the 
surface of the B-Z, i.e., for points for which only 
one of Eq. (5) is satisfied. The derivative of 
energy with respect to k will not vanish in these 
points and they will not really form the surface 
of the Brillouin zone.!? 

According to the program of section III, we 
shall determine now the small representations 
and their connections in the different types of 
B-Z for the simple cubic, body-centered, and 
face-centered cubic lattice. We shall begin with 
the simple cubic lattice, although no metal with. 
this structure is known. 


1 Cf. G. Friedel, Comptes rendus 157, 1533 (1913). For 
: I critical discussion of Friedel’s rule, cf., however, 
P. P. Ewald’s article in Handbuch der Physik, Vol. 
23/2 (Berlin, 1933). 
12? [t is not always true, thus, that the & for which 
dE/dk=0, are those for which the Bragg conditions are 
satisfied. 
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Simple cubic lattice. Here the surface of the 
B-Z is a cube as represented in Fig. 2, with the 
cube edge 2r/d. The inner symmetry elements 
are: the center T, the threefold axis A, the fourfold 
axis A, the twofold axis Z, the symmetry planes 
AZ, ZA and AA. The simplest way to obtain the 
group of a wave vector ending on the surface is 
to draw in all equally long wave vectors which 
are “identical” with it. The group of the figure 
constructed in this way is the group of the wave 
vector. For the arbitrary vector of the surface r/d, 
ky ku, for instance, the figure contains the 
vector — r/d, ky, kı, and the group: of the wave 
vector is, hence, the symmetry plane k,k.. 
Similarly, for the point T, there are four vectors 
ır/d, £r/d, k, and the group contains the four- 
fold axis k, and all the symmetry planes through 
it. It is holomorphic with the group of the wave 
vector ending at A which contains the four- 
fold axis kz and the symmetry planes through 
this. The group of S is holomorphic with that of 
X; that of Z contains the symmetry planes k,k, 
and k,k, and the rotation by r about k,. R has 
the full cubic group like T; M has the group of T 
and, in addition, the symmetry plane ksk, 
X has the same symmetry. 

The tables? give the characters of the irre- 
ducible representations for the groups of the 
wave vectors designated in the upper left corner. 
The corresponding representations will be the 
“small representations” characterizing the B-Z. 
The upper right corner contains the group 
elements. E is the identity, its character will be 

13 The representations of most crystallographic groups 
were given already by H. Bethe, loc. cit., reference 2 


All of them are given in E, Wigner, Gött. Nachr. (1930), 
p. 133. 
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TaBLe I. Characters of small representations of T, R, H. 


DRIE 32 60 6C2 80 J BIC 6JCi 6JC: BIC 
Mm Ji i 1 1 1 1 1 i 1 1 
T: ji 1 —1 =f t 1 Lo =l =l 1 
T |2 2 0 0 —i 2 2 0 0 —I 
Ts’ |3 —l 1 —i 0 3 —1 1 —i 0 
Ts | 3-1 -ı 1 0 3-1 -1 1 0 
refi 1 1 4 1i =i =i -1 -1 -1 
r? | 1 1 —1 -1 1-1 —i 1 1 —1 
Tiz |2 2 0 0-1 —2 —2 0 0 1 
D's |3 —1 1 -1 0 —-3 1 —1 1 0 
Is |3 —1i -i 1 0 —3 1 1 —1 0 


4,T E Ce 2C: 2704 2JCı 
A | 1 1 1 1 1 
Az | 1 1 =1 1 —1 
A,’ | 1 1 —1 —1 1 
Ay’ | 1 1 1 —1 —1 
As | 2 -2 0 0 0 


the dimension of the representation. C3 is the 
threefold axis; C4, the rotation by +r/2 about 
the fourfold axis; C,?, the rotation by r about 
the same axis; and Cz is the rotation about the 
twofold axis; J is the inversion. JC, is the 
product of J and C,, etc. JC. and JC 2 are the re- 
flections in the symmetry planes perpendicular 
to the twofold and fourfold axes, respectively. 
The figures before the symbols of group elements 
denote how many group elements of that kind 
are present in the group. The lower left corner 
gives the notation to be used to designate the 
small representation in question; it is always 
given for one of the wave vectors only, as, for 
instance, for Tin Table I. The small representa- 
tion of the wave vector R which has the same 
character as T'ır’ will be designated by Ri’, etc. 
The lower right corner contains the character of 
the group element above it, for the representation 
to the left. 

In order to save space we have included in the 


TABLE III. Characters for the small representations of A, F. 


A, F E 2C3 3JCı 
Ay l 1 1 
Ao 1 1 —] 
Az 2 — 1 0 
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TABLE IV. Characters for the small representations of È, S. 


X S E C: IC? JC: 
zy i 1 1 1 
Ye 1 1 —1 —1 
23 1 —1 —1 1 
Zu 1 | 1 —1 


tables some wave vectors (H and F) important 
for the body-centered lattice only. 

For the points dealt with so far, it was 
sufficient to denote the group elements by the 
symbols C2, C3, etc., all the rotations about two- 
fold axes being in the same class and having 
the same character in all representations. But 
for the point M, the rotation by m about the 
fourfold axes kz, k, is not equivalent to the rota- 
tion about the fourfold axis, k., which is per- 
pendicular to the wave vector. The latter will be 
denoted by C&L. Although the groups of the 
wave vectors ending at M and at X are holo- 
morphic, the element in the second group which 
corresponds to CL of the first is the rotation by 
m about kz, which is the axis parallel to the wave 
vector TX. It will be denoted by C711. 


TABLE V. Characters of small representations of M, X. 


M |E 20C} C#ı Wan 2C: J WC JCal 2JCıL ICs 

X JE Wea Ctl 2G) 2Cı J WCAL JCA 2JC A1 2JC3 
Mı j1 1 1 1 1 1 1 1 1 1 
Mı li -1 1-1 —1 1 1 1 —1 —1 
Mi {1 -1 1 —1 1 1 —1 1 —i1 1 
M, {i1 —i 1 1 —1 1 —1 1 1 —i 
Mil 1 1 1 1-1 -1 -1 -1 —1 
M,'}1 1 1-1 -1 -1 -1 —i 1 1 
M’|i —1 1 —i 1 —1 1 -i 1 —i 
M'i —1 1 1 ~1 -1 1 -1 --1 1 
M, | 2 0 —2 0 0 2 0 -2 0 0 
M; | 2 0 —2 0 0 —2 0 2 0 0 


’ 


2 E Ca? JC? ICUL 

GK, U E Cs JC? JC: 

D E Ca JC: ICL 
Zi 1 1 1 1 
2: 1 1 1 —1 
Z3 1 —] —1 1 
2 1 —1 1 —1 


This finishes the investigation of the symmetry 
axes in Fig. 2, and there remain only the sym- 
metry planes. A somewhat closer inspection will 
show, however, that the small representations 
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TABLE VII. Compatibility relations between T and A, A, 2. 


rm T2 Tır Tıs Ts 
A Ag Â1å2 Ar âs Ark; 
Ay Aa As Arka AIÀ; 
2 % 2:24 L234 DTP IEP ar} 
r’ ry’ Ti? Tis Tzs 
Ar’ A,’ A'A Aıds Ads 
Ag A As Aıksa AoA; 
Ze 23 gig 212324 2 Dod 


Mı Mı M3 Mı MY ar’ My Me Mi MV 
£ D; & Zu Za 2; Z2 23 2223 2124 
Zı 2 23 Z3 Zr Ze Zı Zi Lely L123 


Tr To TY W UN T h NT Ts 


TABLE IX. Compatibility relations between X and A, Z, S. 


Xı X: X3 Xı Xr X? XY XV Xs Xy 
Aı & A! & A A A X As As 
Z Z Z Zs Zz Z2 Z3 £3 Zə% Zi 
Sı Sı Sı S; S: S3 Se S3 S283 SS 


prevailing on the symmetry axes already de- 
termine the representations for the symmetry 
planes, i.e., they determine whether the wave 
function will remain unchanged or assume the 
negative value, if reflected in one of the sym- 
metry planes. 

A B-Z must be characterized by one each of 
the following 10 symbols: T, A, A, 2, R, T, M, 
S, X and Z. If there is an accidental degeneracy, 
however, the representation may change on an 
axis, etc. Not all the combinations of symbols 
correspond to possible B-Z. The small repre- 
sentation A on the fourfold axis must be con- 
tained in the representation T of the center, 
if this is considered as a representation of the 
group of A, and similar conditions exist between 
all pairs of adjoining symmetry elements. Table 
VII shows with which A, A, È, a certain T 
can be combined in the symbol of a possible B-Z. 
The compatibility relations between R and T, 
A, S, are the same as those between T and 
A, A, 2. (Table VII.) These compatibility 
relations reduce considerably the number of 
possible types of B-Z. In addition to these com- 
patibility relations, there are others originating 
from the four sets of wave vectors characterized 
by k.=0; kz=k,; ky=k,; k.=r/d. Every wave 
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vector satisfying one of these equations, has a 
group consisting of a symmetry plane, and the 
corresponding wave function will belong either to 
the symmetric, or to the antisymmetric repre- 
sentation of this group. This representation must 
be contained in the small representations of the 
axes lying in this plane. Table X gives, under +, 


TABLE X. Compatibility relations on symmetry planes. 


SYMMETRY PLANE + - 
PP) Lows 
k,=0 AAAs Ay’Ao/As 
iZs 224 
2123 Lara 
k,=k,>k, AyA3 AgAs 
T1T2'T; T:Tı Ts 
AA; AoAg 
ky =he <he KR SSi 
Ay, Ae’As A2Ay’As 
SiS4 2S3 
ke = r/d Tı T: Ts Ti’ Ty Ts 
124 223 


those representations along the axes, which are 
compatible with the symmetric representation in 
the plane, and under — those which are com- 
patible with the antisymmetric representation. It 
shows that, for instance, 2, is incompatible with 
Ar, Ar’, Za, Zi, Ao, T>, Tr. 

As an example, we may consider the three B-Z 
which stick together at k,=k,=k,=0 having for 
this wave vector the representation Tas. These 
three B-Z will be separated along the twofold 
axis, having there the small representations £,, 
Ne, Xa, respectively (Table VII). We may con- 
sider the one with &,. This necessarily goes with 
As along the fourfold axis (Tables VII and X), 
and sticks together with one of the other zones 
there. Along the threefold axis it may have one 
of the two representations A, or As. We shall 
assume that it has Ao. For R, we still have the 
choice of Re, Ris’, Ri’ or Res. We shall choose Ro. 
This requires, then, S, and Tz, and hence Z,. 
According to Table VIII, it will have M;’ and 
according to Table IX, X;. Its whole symbol 


will be TPosSoAsAoRis7e5sZ4A1,'X;,’ and we see 


that most small representations were uniquely 
given by the compatibility tables and the 
previous choices. 

We believe that the above description of B-Z 
for the simple cubic lattice is complete from the 


point of view of symmetry. We are well aware, 
of course, that many of the types which are 
possible geometrically will not be important 
physically, since they have, for example, too 
high energies. It appeared to us, however, that 
for the sake of clarity a complete geometric 
discussion should be given once for a simple 
case. 

The construction of the compatibility tables is 
very easy. If one is interested, e.g., in the com- 
patibilities between 2 and M, one considers for 
M the characters corresponding to elements 
which are contained in Z. These elements are 
E, Ca, JC?L, JC, (one must take JC/L, not 
JC, since the latter are the symmetry planes 
k.k,, kyk: which do not occur in the group of 2). 
The corresponding characters in Ms, for instance, 
are 2, 0, —2, 0. One sees that this is the sum of 
the characters of 2, and Z; and these are, conse- 
quently, compatible with M,;,. Thus it will not 
be necessary to give the compatibility relations 
for the other lattices explicitly. 


VI. Bopy-CENTERED CuBic LATTICE 


The shape of the surface of the B-Z is self- 
evident in the simple cubic lattice but not in 
the body-centered lattice. The identity: periods 
can be taken as three space diagonals, with 
coordinates 1/2d, +1/2d, +1/2d. The shortest 
vectors of the reciprocal lattice are the face 
diagonals, with coordinates 0, 427/d, +2r/d; 
+2r/d, 0, +2r/d; +2r/d, 427/d, 0. Since the 
inner of the B-Z should contain only different 
vectors k, the addition of a vector of the re- 
ciprocal lattice to a k lying inside the B-Z must 
lead to a vector in the outside. This is most 
simply accomplished by choosing the rhombodo- 
decahedron of Fig. 3 as the surface, in which 
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opposite faces just differ by a vector of the re- 
ciprocal lattice. The distance TH is 2r/d.'* 

The symmetry elements in the inside of the 
B-Z are the same as in the simple cubic lattice, 
r, A, A, 2, and the compatibility relations 
between these are also maintained. The point H 
has, however, the full cubic symmetry, since the 
vectors of the reciprocal lattice transfer it to 
all the end points of the coordinate axes. The 
point P is identical with three similar vertices, 
forming a tetrahedron. 


Tanit XI. Characters for the small representations of P. 


rP E 3Ce 8C3 6IC4 6IC: — 
Pı 1 1 1 1 1 
P, 1 1 1 —1 —1 
P; 2 2 —1 0 0 
Pı 3 — i 0 —Í 1 
P; 3 —i 0 1 —1 


Tapir XII. Characters for the small representations of N. 


Im 
u 
ty 


Ca? Call JC JCal JCM 


|I 


| 
| 


2 
pomt mei, pe pomt omo pan Demi peace 
| 
puuh mme pumi tna p Zee p evel 
l 
pain 
| 
— madd 
io] 
bub mel h p pb manly p pee 
II I 
Dodie poh pb par pn ponde jan pand 


The small representations for the other points 
were already given in the previous tables. We 
shall not give the compatibility relations be- 
tween axes and points, since they are easily 
obtained by the method outlined in the previous 
section. It may be mentioned that the group of 
the vectors ending in a general point of the 
surface is the symmetry plane JC:. The following 
relations are analogous to those of Table X. 


Tante XII. Compatibility relations for symmetry planes. 


SYMMETRY PLANE + = 


k,=0 Lg, âA, GiG; &,%2, âA As, G:G; 


ke=k,>k: M125, AAs, DiDa 2224, AzAs, DD, 
k,=k. <k: Ara, âA, FiF AgAs, Azdı’ds, FaFa 
k,+k,=2r DD, FF;, GG; D2Ds, FP; F3, GG, 


N The vectors k in the inside of the B-Z are transformed 
under this choice again into vectors in the inside by every 
symmetry element. 


Since the surface of the B-Z is a symmetry 
plane, the derivative of the energy perpendicular 
to this plane is zero on the surface. 


VII. FACE-CENTERED CuBIC LATTICE 


The B-Z of the face-centered cubic lattice have 
a rather complicated structure. The reciprocal 
lattice is the body-centered lattice, the shortest 
vectors of which are the space diagonals with 
components +2r/d, +2r/d, +2r/d. If we 
assume the inner of the B-Z to be bounded by 
the octahedron with the 8 planes +x+y-&2 
=3/d, then no wave vectors of the inside will 
differ by one such vector. Nevertheless, some of 
them will be equivalent, differing by the sum 
of two shortest vectors of the reciprocal lattice, 
+4r/d,0,0;0, +47/d,0;0, 0, +4r/d. In order to 
exclude these, one must cut off the corners of 
the octahedron by planes parallel to the coordi- 
nate planes at the distance +27/d from these. 
The resulting figure is the well-known truncated 
octahedron of Fig. 4. With this choice of the 
surface of the B-Z, every wave vector of the 
inside will go over into a wave vector of the in- 
side by all the symmetry operations. This 
requirement, however, which determines the 
whole shape of the surface for the simple cubic 
and body centered cases, fixes the surface here 
only at the truncating planes, but not at the 
octahedral planes. One could, for instance, bulge 
out in all octahedral planes the part which is 
shaded on one of the planes in Fig. 4 and bulge 
in by an equal amount the unshaded regions. 
The resulting surface would still satisfy all 
requirements. The truncating planes, on the 
other hand, cannot be deformed. If we pushed 
out a point on the k, =22/d plane, we would have 
to push in the corresponding point on the 
k:= —2r/d plane. After this, however, the re- 
flection on the k,k, plane would carry over wave 
vectors of the inside to the outside of the surface 
of the B-Z. . 

The B-Z is always uniquely determined if there 
is a symmetry plane perpendicular to the vector 
r of the reciprocal lattice,!5 which generates that 
part of the surface. In this case the surface lies 
at the distance r/2 on both sides of the symmetry 
plane. This was true for the vectors parallel to 
the coordinate axes which generated the surface 

8 Cf. Eq. (2a). 
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for the simple cubic lattice, it was true for the 
vectors parallel to the face diagonals in the body- 
centered structure and it is true for the vectors 
generating the truncating planes in Fig. 4. The 
situation for the octahedral plane of Fig. 4, how- 
ever, is similar to that for the triclinic lattice and 
will be shown to have similar consequences. 

Although the surface of the B-Z is thus left 
undetermined by general requirements, it is 
certainly allowable to assume it to have the shape 
of Fig. 4. . 

In the inside of the B-Z we have again the same 
situation as for the simple cubic lattice with the 
same compatibility relations holding between 
the small representations of T and the two-, 
three- and fourfold axes. This. also applies to the 
points X, Sand Z on the cubic plane. The point 
W is identical with three other points of the 
surface, two of which are shown on the figure, 
while one at the bottom is hidden. The small 


TABLE XIV. Characters of small representations of W. 


w E Ce 2C3 2./Cı 2JCa 
W, 1 1 1 1 1 
W,’ 1 1 1 1! —1 
W, 1 1 —Íİ 1 —Í 
W;’ i 1 —1 —1 1 
W; 2 —2 0 0 0 


TABLE XV. Characters of small representations of L. 


L E 2C3 3C: J 2ICs 3JCı: 
Lı 1 1 1 1 1 1 
In i 1 —1 1 1 —1 
Ls 2 —i 0 2 —1 0 
Li’ t 1 1 -Í -í — {Í 
L,’ l 1 -1 —1 1 1 
Ly' 2 —i 0 —2 1 0 


representations for the points K and U were 
given in Table VI. Z is identical with its antipode. 
The points Q on the line LW cannot be moved in 
or out. They belong to the surface, since they 


are carried over into themselves (i.e., into the 
“identical” point on the opposite face) by the 
twofold axis bisecting the Z and —X axes. 
The wave function of the wave vector ending at 
Q will be either symmetric or antisymmetric 
with respect to this rotation. In the former case 
it is compatible with Li, Li’, La, Ly’ on one side 
and with WiW,’Ws on the other. If it is anti- 
symmetric, it is compatible with Le, Le’, Ls, Ls’ 
and W2, W.’ and Ws. | l 

The group of the points on the lines LK, KW, 
LU, UW contains only the symmetry plane on 
which they lie, they have no additional symmetry 
owing to their position on the surface. This is 
natural, since the surface can be shifted away 
from them. The Compatibility Table X holds for 
kz=ky>k, between 2 and A, but there is nothing 
to replace T, and T must be omitted also from 
the last section of (k, = r/d) of this table. The rest 
of the table remains valid, however, and should 
be supplemented by the compatibilities just 
given, owing to the symmetry of the point Q. 

The surface of the B-Z at the octahedral 
planes cannot be chosen in such a way that the 
“identical” point kz—2r/d, k,—2r/d, k,—2r/d 
to every point of the surface could be reached by 
a symmetry operation also. This has the conse- 
quence that the derivative of the energy per- 
pendicular to the somewhat arbitrarily chosen 
plane octahedral face will only vanish on the 
diagonals (LW) corresponding to the separating 
lines between shaded and unshaded regions. On 
the other hand, it will have the consequence also 
that the energy for k,=#/d+u, k,=r/d—n-v, 
k,=m/d+v will be equal to the energy for 
— a/d+n, —xr/d—p—v, -r/d-+v and, because 
of the twofold axis, also equal to the energy for 
r/d—v, r/d+p+v, +/d—p. The energy as func- 
tion of k will be symmetric with respect to the 
line LW on the surface and, hence, will have 
on the octahedral surface, a sixfold rotational 
symmetry. 
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According to Herzfeld, the “chemical constant” of the electron gas is different in the high 
temperature region, where all actual measurements are carried out, from its value at very low 
temperatures. Hence the value of the constant A in Richardson’s equation must be different at 
high temperatures from the well-known low temperature value of 120 amp./cm? °K?. The 
present paper gives a calculation of the high temperature value in terms of thermodynamic 


quantities and a numerical estimation of these. 


1 


T is well known that the constant A in 
Richardson’s formula! for the thermionic 


current 2 
i=AT?eW'RT (1) 


(W is the measured work function) should have 
under ideal conditions the value 120 amp./ 
cm? °K?, Under “ideal conditions” we mean that: 

(a) The reflection coefhcient r for electrons 
should be zero, i.e., all electrons striking the 
surface should be absorbed by the same. 

(b) The work function W should be constant. 
The work function is defined as the energy 
necessary to remove one electron from the top 
of the Fermi distribution and bring it to a point 
near the surface, but still to a distance from it 
which is large as compared with the lattice 
constant. The “top of the Fermi distribution” 
again means the energy of that electron which 
has the highest energy if all electrons are in the 
lowest possible state. This is the work function 
which is measured experimentally by the photo- 
electric effect if Fowler’s method? is used for 
the evaluation. 

(c) That the electron gas in free space should 
behave like an ıdeal gas. 

(d) Finally, the surface must be uniform and 
thus free from patches.and contaminations. 

This last condition and its effects have been 
carefully analyzed by J. A. Becker.? It seems 
that it can account in the case of the most 
careful experiments, and for low accelerating 
potentials for small deviations of A from 120 
1 For the history of Richardson’s discovery cf., e.g., W. 
Schottky, Handbuch der Experimentalphysik, Vol. XIII 
(Leipzig, 1928), or A. L. Reimann, Thermionic Emission 
(London, 1934). 


2 R. H. Fowler, Phys. Rev. 38, 45 (1931). 
3]. A. Becker, Rev. Mod. Phys. 7, 95 (1935). 


only, and generally only for a lowering of this 
value. Under the conditions (a)-(d), (1) is a 
consequence of the principle of detailed balance 
and general quantum statistical laws.* 
Condition (c) is fulfilled for not too high 
temperatures® and will be assumed further on. 
If the reflection coefficient is different from zero, 
we should only expect an additional coefficient 
1—r in (1). Since 1—r is certainly in the order 
of magnitude of 1 and smaller than this, the re- 
flection can explain only a deviation of A from 
120 by such a factor. In spite of this, the 
experimental values’ are in most cases rather far 
from 120. It must be emphasized, however, that 
these experimental values are still subject to 
comparatively large variations and in many 
cases hardly reproducible. They are most trust- 
worthy if the work function is measured photo- 
electrically also. It 1s only too common an 
experience that a new set of experiments changes 
the measured values of both A and W con- 
siderably, generally both in the same direction, 
which indicates that not so much the measure- 
ment of 7, as that of its temperature coefficient, 
is crucial. In spite of this, it is generally accepted® 


4Cf. reference 1 and K. F. Herzfeld, Phys. Rev. 35, 
248 (1930), also J. H. Becker and W. H. Brattain, Phys. 
Rev. 45, 694 (1934). 

5Cf. M. v. Laue, Jahrbuch der Radioaktivität und 
Elektronik, Vol. 15, 205, etc. (1918) and W. Schottky, 
reference 1. 

6 L. Nordheim, Zeits. f. Physik 46, 833 (1928); Proc. Roy. 
Soc. Al21, 626 (1928); R. H. Fowler and L. Nordheim, 
Proc. Roy. Soc. A119, 173 (1928) show that the reflection 
coefficient is very small in general, around 0.07. See also 
Langmuir and Jones, Phys. Rev. 31, 401 (1928). 

7 Cf. A. L. Reimann, reference 1 and A. L. Hughes and 
L. A. DuBridge, Photoelectric Phenomena (New York, 1932). 
For Cb, H. B. Wahlin and L. O. Sordahl, Phys. Rev. 
45, 886 (1934), for Rh unpublished data of H. B. Wahlin. 

8 ] am much indebted to Professor L. A. DuBridge for a 
discussion on this point. Cf. also his monograph, Actualités 
Scientifiques et Industrielles, No. 268, Paris, 1935. 
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On the Constant A in Richardson’s Equation 


CONSTANT A IN RICHARDSON’S EQUATION 


TABLE I. Measured values of the constants in Richardson's 


equation. 

A wW A w 
Cs 162 1.81 | Ta 60 4.12 
Ba 60 2.11 | Hf 14.5 3.53 
Ni 1,380 5.02 | Th 70 3.38 

Pd 60 4.98 | Re 200 5.1 
Pt 17,000 6.27 | Cu? 65 4.33 
Mo 4.15 | Ag? 0.76 3.56 
W 60 4.53 | Au? 4.32 
Zr 330 4.12 | Cb 57 3.96 
Rh 35 4.81 


that the deviations of A from 120 are real and 
must be explained by the breakdown of assump- 
tion (b). The present paper will be devoted to a 
discussion of (b). 


2 


The consequences of the temperature variation 
of the work function have often been analyzed,’ 
especially by Bridgman and Herzfeld. I shall 
follow the latter’s treatment quite closely. The 
reason this subject is taken up again, is that our 
somewhat improved knowledge of the metallic 
structure allows a somewhat closer analysis 
which shows that in addition to the effect, 
analyzed by Herzfeld (which is reproduced), 
there is another effect which tends to compensate 
for it in many cases. This is the reason, perhaps, 
that the measured A’s are not all higher than 120, 
as one may expect from Herzfeld’s paper. 

It has been emphasized by Becker and 
Brattain!’ that the quantities measured in case 
of variable work function (and thus contained in 
Table I) are W* and A*, defined by 

W*= RT?(d/dT) In i/T?, 


In A*=In i/T2+T(d/dT) In i/T? (2) 
=!n i/T?+ W*/RT, 


where z is the thermionic current. 

The procedure to obtain theoretical expressions 
for W* and A* is to calculate first 7 and obtain 
then W* and A* by (2). 


3 


In order to calculate the current, one may 
first calculate the vapor pressure P of the 
electron gas at a point with zero electrostatic 


®P. W. Bridgman, Phys. Rev. 31, 90 (1928); K. F. 
Herzfeld, reference 4. 
10 Reference 4. 
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potential by thermodynamics. If P is given, 
the number of electrons striking the surface can 
be calculated by elementary kinetic theory (m is 
the electronic mass) to be P/(2xmkT)*. If r 
is the reflection coefficient, this gives a current 7 
per cm? of the metal. 


i= Pell-r)/(ArmkT)). (3) 


The current from the metal is equal to this in 
equilibrium and is supposed to be equal also 
under the conditions of thermionic emission. 
Thus the problem reduces to the calculation of 
the vapor pressure P. 

Both the vapor pressure and the work func- 
tion! depend, in addition to the temperature T, 
on the (mechanical) pressure p, under which 
the metal is kept. We shall assume that the 
metal carries always only an infinitesimally small 
charge only. If we allow « electrons per mole of 
metal to escape and keep the volume of the 
metal constant, the mechanical pressure will 
increase by ep.(v, T). The energy necessary for 
the removal of e electrons, when the metal is 
kept at constant volume, is «W,(v, T). It seems 
to be simplest to apply the Clausius Clapeyron 
cycle by keeping the metal at constant volume. 
We then have” 


ð In P(v, T)/aT = W.(v, T)/RT?+5/2T, (4) 
which gives 
In P(v, T) di °) 4 5/2)1 T+j 
n (v, r RT ) n J 
T dr 
+ J [W.0, 1) — Wolo, J 
(5) 


j= (3/2) In (2amk/h?)+1n 2. 


The well-known Sackur-Tetrode value of the 
chemical constant must be increased by In 2 
because of the spin. In order to determine p., 
we can apply the usual cycle to a piece of metal 


u For the definition of the work function see the pre- 
ceding section. The vapor pressure P is a function of the 
mechanical pressure p for ordinary vaporization also, but 
this effect is rarely considered. . 

12 In this section, temperature T and volume v will be the 
independent variables. Thus temperature derivatives are 
always to be taken at constant volume, volume derivatives 
at constant temperature. 
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which is deprived of e electrons permole. The 
pressure is p+ep., the energy E+eW,, so that 
we have 


ðE OW, 
—+t+e—+p+e.= 
ov 


ov 


ð 
T— Epe), 
m eg be), (6) 


which gives in addition to the usual 


dE/dv+p=Tdp/dT (7) 
valid for any substance, the required 
aW../dv+p.e=Tdp./dT (8) 
since (6) holds for every e. From (8) follows 
dW ,(v, 0) 
= nn 
TrəW,(v, t) 9W.(v, 0) dr 
Ee 


n (4*/A(1—r))= 


ð ð 1 
W*= RT? tna) In — 
| oT 


—W,(v, 0)/RT+W,(v, T)/RT 
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We shall now calculate W* and A* by (5), (3) 
and (2). We have to remember, for this, that 
when the thermionic current is measured at 
different temperatures, not the volume, but the 
mechanical pressure of the metal is kept at 
constant value, namely zero. The temperature 
derivatives in (2) are meant to be taken at 
constant pressure, therefore. Hence, if a is the 
volume expansion coefficient, v9 the molal vol- 
ume, we have 


ðv T? 
ö ð 
= RT na) In (10) 
oT ðv T»? 
or W*=W,(v, T)+vaTp. (11) 


as the comparison with (9) shows. For A*, 
we have 


+ f CW.(v, We, 0) (dr/Rr?)+veep./R. (12) 
0 


Here A is the constant for the ideal case, 120 
amp. cm? °K2. The last term in (12) is that of 
Herzfeld, the other ones have been omitted by 
him, since the work function chiefly depends on 
the volume. Its contribution to A* is not neg- 
ligible, however. In general, p. is negative, the 
pressure decreases if an electron is removed. 


4 


For the evaluation of (12) we shall make 
assumptions very similar to those of Herzfeld. 
The total energy of the metal contains two parts. 
The first, V(v) arises from the motion of the 
electrons, depending, therefore, on the volume 
only.!® The second arises from the motion of the 
nuclei. The first part is the most important for 
the actual value of the work function, its change 
by the removal of electrons from the metal gives 
practically all of this quantity. For the second 
part, an expression RTD(®/T) will be used. In 
this, © is a function of the volume and will be 
changed also by the removal of electrons by the 
amount e®, if e electrons per mole are removed. 


13 We shall neglect the small specific heat of free electrons 


(cf. A. L. Reimann, reference 1). 


The total energy of a mole of the metal, out of 
which e electrons have been removed and brought 
to a point near the surface but still sufficiently 
distant from it in order to make the image 
force negligible, is thus 


E(v, T) +eW,(v, T) = Viv) +eW.(v, 0) 
+RTD(0/T)+eRO.D'(0/T). (13) 

These assumptions can be justified on the basis 

of the usual assumption of fast electronic and 


slow nuclear motion," but I shall not enlarge 
upon this subject. From (13) one readily obtains 


W,(v, T)=W,(v, 0) +RO.D'(O/T), (14) 
which pai with (12) 
dr vape 
2 ~=p(2)+e.f fo =)+ 
"A (1—7) R 


= (0./ aa T)—(0./ ene/r T) +voap./R, 
In (A*/A(1—1)) = —C,9./RO+v00p./R 
= —30,./O+veap./R, (15) 


14 Cf., 


e.g., H. Pelzer an E. Wigner, Zeits. f. physik. 
Chemie B15, 445 (1932); F. London, Zeits. f. Physik 74, 
143 (1932). 
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CONSTANT A IN RICHARDSON'’S EQUATION 


where C,=3R is the specific heat of the metal 
at constant volume, it has assumed its classical 
value at the temperatures under consideration. 
The thermionic emission can be considered as 
a generalized dissociation reaction, and cal- 
culated, hence, according to the transition state 
method for calculating reaction rates.’ The 
result is identical with the one obtained here, 
but I preferred this method because it is the 
more usual one in these problems. As has 
been remarked by Herzfeld, (15) corresponds to 
the expression for the chemical constant of the 
electron gas at high temperatures, while the 
familiar 120 amp./°K? cm? is the low tempera- 
ture expression. This is quite similar to the 
situation for Hoe, e.g.: at low temperatures the 
chemical constant is (3/2) In (2rmk$!?/h?) while 
at high temperatures there is an additional 
In 8m?kJ/h?. In contrast to He, the change of 
the “chemical constant” in our case is due to a 
change in the condensed state. Also the tempera- 
ture dependence does change in H; (from T*? to 
T?) while there is no such change in our case. 


5 


Next we shall try to estimate the quantities ©, 
and p, occurring in (15). It must be remarked at 
the outset that these two quantities are of a 
rather different nature: ©, is a function of the 
volume only, but p.e depends, according to (9), 
on the temperature and can be reduced to the 
more basic quantities, dW,(v, 0)/dv, ©. and 
dQ,/dv by (14). One obtains thus 


In (A*/A(1—1r)) = —C,0./RO 
— (voæ/R)ðW (v, 0)/ðv 


—waT(d/öv)(C,9./RO) (16) 


since the integral in (9) ıs just the v derivative 
of the expression we have calculated in the 
previous section. 


18 For the history of this method cf. E. Evans and M. 
Polanyi, Trans. Faraday Soc. 31, 875 (1935). The formula 


i=ATiev!RT (*) 


used by J. A. Becker and W. H. Brattain (reference 4) has 
been derived in the papers to which they take reference only 
for the case when the work function is independent of 
temperature. Of course, it is always possible to represent 
the thermocurrent by a formula (*) with variable w, but 
this w will not be equal to the work function. It follows 
furthermore from the third law that w can have no term 
linear in T at low temperatures (cf. reference 9). 
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It would be rather difficult to estimate the last 
expression in (16) since it contains d@,/dv. If 
one assumes however, that v(d/dv)(C,@./RO) is 
of the order of magnitude of C,9./R® itself— 
which seems reasonable, the last term can be 
neglected altogether, because «7 =10"°. Since 
W.(v, 0) is the energy necessary at the absolute 
zero, to remove one electron, dW,/(v, 0)/dv is the 
negative increase in pressure, due to the removal 
of one electron. Since the electrons do the binding 
in the metal, the removal of one will loosen the 
binding and the actual pressure decrease will not 
be quite as great as calculated on the free electron 
hypothesis. For alkalis, one can obtain an esti- 
mate of this magnitude from the formulas, 
derived for the calculation of the work function.!® 
Our W,(v, 0) is in the present approximation 
what is denoted there by £. Eq. (7) reference 16 
reads in our notation: 

— W,(v, 0)=13.5(E0+5:2.21/3r.?+1.18/r, 
—0.58/(r,+5.1) —0.19r,/(r,+5.1)?—eD). (17) 


The work function is expressed here in volts, r, 
in Bohr’s units (0.528-10-® cm) and is defined by 
Arr3/3=vw/L=atomic volume. D is the mo- 
mentum of the double layer on the surface and 
will be neglected hereafter. From (17) we obtain 


v00W ,/9v= (r,/3)9W ,/Ar.= 
—4.5(ndEo/dr,—7.4/r2—1.18/r. 
+0.587,/(r.+5.1)2+0.387.2/(n.+5.1)?). 


In order to eliminate dEo/dr,, we set the deriva- 
tive of (8) reference 16 equal to zero. This 
expresses that the total energy is a minimum 
for r.. 


—r,dEo/dr,+4.42/r,?-+0.28/r, 


(18) 


—0.587,/(r,+5.1)?=0. (19) 
This gives 
vð W,(v, 0)/dv 
=4.5(3/r2+0.9/r,—0.38r,2/(r.+5.1)?), (20) 


which is for r,=4 about 1.5 volts. The second 
term in (16) thus becomes with a«=21.10° 
about — 3.6. 


16 J, Bardeen and E. Wigner, Phys. Rev. 48, 84 (1935). 
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The quantity ©./© is the percent change 
of the vibration frequencies for one percent 
change in the number of electrons. It is, there- 
fore, half the percent change of the restoring 
force F, if one percent of the electrons is removed. 

In order to obtain an estimate for this, we 
may consider first the whole restoring force if 
only one ion is displaced. This is the force which 
acts on the displaced ion and originates from the 
electron cloud which is itself distorted by the dis- 
placement of the ion. 

It is difficult, in general, to calculate the dis- 
tortion of the electron clouds. Naturally, the 
inner electrons can be considered to be rigidly 
attached to the ion. We may assume that the 
valence electrons have an even distribution 
throughout the lattice and show wriggles in the 
neighborhood of the ion only. Then, the total 
charge distribution of the valence electron can be 
considered to contain two parts: pi, the varia- 
tions of which are appreciable only in distances 
comparable with the lattice constant, and pe 
which contains the small wriggles only. It seems 
reasonable to assume that the ion carries the 
wriggles with itself but as long as only one ion 
is displaced, the slowly varying part of the charge 
will not be much affected. If the ion was displaced 
by x in the X direction, it will be acted upon by 
a force 


Fx =xZedE,/ 0x, (21) 


where E is the field due to the slowly varying 
part pı of the electronic charge and Ze the ionic 
charge. We have, by Poisson’s equation div E 
=4rp, and for a cubic lattice, this gives because 
of 0E,/dx=d0E,/dy=dE,/dz 


F=4rpZe/3. (22) 


The percent change of this,!” for a one percent 
decrease of the number of electrons is simply 
—pi:/pim where py, is the slowly varying part of 
the density of the highest energy electrons, at 
the point where the ion is; pım is the mean value 
of the same quantity for all valence electrons. 
The first term of (16) is therefore, approximately 


(3/2) (pre/pim)- (23) 


17Qne can calculate, from (22), the characteristic 
temperature of the metal, and it comes out in the right 
order of magnitude. A more satisfactory calculation of the 
vibrational frequencies of a metal has been given lately 
by K. Fuchs, Proc. Roy. Soc. A153, 622 (1936). 
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This will be comparatively large (~1.5), if the 
last valence electron is an s electron, small 
otherwise. If the highest energy electron happens 
to be such an electron in the lattice, for which 
px: is zero, the whole first term will be negative 
also, because of the approximations we made. 


6 


It is hardly necessary to mention that while 
the considerations of sections 2 and 3 are, (12) in- 
clusive, strictly based on thermodynamics and 
those of 4 also should be correct within a very 
small error, the considerations of section 5 are 
very crude and will not give, in general, more 
than the sign of the effects and their order of 
magnitude. What can be claimed safely, is only 


(a) that it is purely accidental if the constant 
A* in Richardson’s equation (defined by (2)) 
has the value 120 amp./°K? cm? sec. 

(b) that the deviation in the In of A* should 
be of the order of magnitude 1-3, rather negative 
than positive. It should be negative always when 
the last valence electron is not essentially an s 
electron in the lattice. 


It should be emphasized once more that rather 
large deviations in the constant A* can be caused 
by a patchy character of the surface, as con- 
sidered by Becker and Rojansky.? These patches 
can consist in surface contamination or also in 
the polycrystalline character of the surface, 
since different crystal planes will have different 
work functions. It would be desirable, from this 
point of view, to measure the work function and 
A* on definite crystal planes, i.e., to use single 
crystals for the experiments.?® 

In some cases, zero energy outside of the metal 
may correspond to a forbidden region inside. 
This would have as consequence an unusually 
large reflection coefficient.!? The effect of a 
reflection coefficient has also been omitted in 
this paper. | 

It is a pleasure to express my gratitude to 
Professor Herzfeld for his valuable discussion and 
criticism of this paper. 


18 Professor H. B. Wahlin has pointed out to me that 
part of the effect of.the continued heating and flashing of 
the samples is perhaps due to recrystallization. Cf. H. B. 
Wahlin and J. A. Reynolds, Phys. Rev. 48, 751 (1935). 

19 P, M. Morse, Phvs. Rev. 35, 1310 (1930). 
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1. 


The very simplest form of the theory of the energy bands in metals? 
gave for many problems such accurate explanations of often very in- 
tricate properties of metals and alloys that it may well appear super- 
fluous to consider extensions of the simple form of the theory. Never- 
theless I believe that if the theory of metals is to broaden so as to include 
an even greater variety of phenomena, however satisfying the simple 
picture is for some purposes, the development of a more rigorous theory 
would not be superfluous. 

The above mentioned “ simple” theory considers the metallic elec- 
trons as moving (without interacting among themselves) in a pre-existing 
field with the symmetry given by the space group of the crystal. This 
picture fails in two respects: | 

(1) The field is not preformed, but owes its origin partly to just those 
electrons whose motion we are going to consider. To meet this objection, 
one must base the considerations on the Hartree-Fock equations.? 
This will have the advantage of allowing a determination of the field in 
which the electrons move, whereas the properties of the pre-existing 
field of the simple theory had to be taken entirely from experiment. 
Surprisingly, enough, in many cases, the field has such a form that plane 
waves form a reasonably accurate solution for the outer electrons. In 
these cases, the only major difference between this stage of the theory 
and the simplest form lies in the energy values which the two pictures 
attribute to the same plane waves. This difference of energy values 
originates in the exchange energy, characteristic for Fock’s equations. 
This Hartree-Fock stage of the theory has been worked out for a great 
number of phenomena, and in many cases the results differ significantly 
from those of the simple theory. 

(2) The simple theory assumes that the electrons move independently 
of each other.4 This is the approximation characteristic for the Hartree- 


1 Cf. e.g., N. F. Mott and H. Jones, The Theory of the Properties of Metals and 
Alloys. Oxford, 1936. Chapters II, III, VI, VII and especially Chapter V. 
Also several chapters in H. Fröhlich, Elektronentheorie der Metalle, Berlin, 1936. 

* The term Hartree-Fock picture will be used for the approximation in which 
the wave function can be written as a single determinant of single particle wave 
functions containing the spin. Such a wave function has, in general, neither 
any definite spacial symmetry, nor any single multiplicity. i f 

? F. Bloch, Z. Physik, 1929, 57, 545, made the first calculation on this basis. 
Cf. also L. Brillouin, J. Physique, 1932, 3, 565. It was developed subsequently 
for many problems by J. Bardeen, Physic. Rev., 1936, 49, 653; 1936, 50, 1098 ; 
1937, 52, 688. Perhaps the most important effect of the exchange energy which 
has not yet been treated thoroughly is the increase of the energy gap between 
occupied and unoccupied Brillouin zones. 

* One should say, more accurately, that in every stationary state the electrons 
show only those statistical relations which are required by the antisymmetry of 
the whole wave function. However, if one compares different stationary states, 
the energy values for one electron will be different for different states of the others. 
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Fock model. Unfortunately, it is difficult to evade this approximation 
and but few attempts have been made in this direction. The old Heitler- 
London scheme does not use this assumption; it makes approximations, 
however, which in the case of the conduction electrons in metals, are 
probably even more objectionable. 

The first question which immediately arises is: Why is it that the 
very simplest form of the theory gives such satisfactory results for 
many—indeed most—questions? I shall put forward the view that 
although the two above-mentioned improvements both increase the 
absolute value of the binding energy (‘‘ correlation energy ''), their effects 
on the energy differences between different states of the metal tend to 
compensate each other. I shall try to show this in two cases, 


2. 


In the simplest form of the theory, the average energy of the 
electrons if half of them have spins in one direction, the other half in 
the other direction (non-ferromagnetic state) is :— 


"ls g'h k3 n'h 2:21R 
€, = Eo + 3 10m vn = Eo + r 4 a 2 (1a) 


Flere E, is the average potential in the lattice, the second term is the 
average zero point energy, V and rn total volume and total number of 
electrons, r, is defined by (47/3) (7,4))*= V/n with a, the radius of 
Bohr’s first orbit in H; Ry denotes the ionisation energy of H, and 
& Planck’s constant divided by 2m. The energy for the state in which 
all spins are parallel is 


la = Ep 4 27/8 . (2a) 
According to these well-known formulæ, the more stable state would 
be always non-magnetic; the ferromagnetism could not be due to free 
electrons. The energy difference between the non-magnetic and mag- 
netic state is given in Fig. 1 by the line marked (a). 

The Hartree-Fock picture changes this situation radically. Bloch ® 
has made the calculation for the case where the modulations of potential 
do not affect the energy differences between electrons with different 
values of the momentum. This would certainly be the case if one could 
replace the positive ions by a uniformly distributed space charge. It 
is probably very nearly true,” however, in the alkalis also. The only 
change from the simple picture occurs then through the appearance of 
the exchange energy, as the ordinary interaction is supposed to be 
included in EZ, One has 


22ıR sıR 
rer. 
8 8 


2'21 Ry 0916 R 
= Ey 200200. (1b) 
for the non-magnetic and 
em = Ey + 2th Eg Ry 4, O96 Ry . (2b) 


Ts 
for the magnetic state. The energy difference between the non-magnetic 
and the magnetic state is plotted in Fig. I (line (b)). It becomes negative 


š Cf. e.g., A. Sommerfeld and H. Bethe, Handbuch der Physik, Vol. 2412. 


Berlin, 1933. 
* Cf. cot 3. 7 Cf. Mott and Jones,! p. 315, and references given there. 


Electron Interaction on the Energy Levels of Electrons in Metals 433 


680 ENERGY LEVELS OF ELECTRONS IN METALS 


for r, > 5:47 and elements with r, > 5:47 (i.e., Cs) should be ferro- 
magnetic, even if the effect of the modulations of the potential on the 
electronic energy can be neglected. 

As pointed out before, it is somewhat difficult to go beyond the 
Hartree-Fock picture. However, with the above assumption, a calcula- 
tion has been performed ® which adds a term to (2a) 


cg = Ey 221 RY O96 Ry, o58Ry ag 
r, r, 1, +51 

while it leaves (25) practically unchanged. Fig. ı shows that the differ- 
ence between (Ic) and (2b) practically coincides with the difference 
between (1a) and (2a), again leading to the result that free electrons 
(.e., electrons for the motion of which the lattice potential is of no 
material importance) do not 
lead to ferromagnetism. This 
accords well with recent ideas 
on ferromagnetism °? according 
to which ferromagnetism is due 
to the d-electrons which are 
rather tightly bound to the 
lattice. It must be admitted, 
however, that Fig. I gives an 
exaggerated picture of the 
agreement between the simple 
form and the accurate theory. 
5 % The additional term in (Ic) is 
Fic. 1.— Energy difference between the ferro- only app roximately correct and 
i etic and the non-magnetic states. for higher r, a “correlation 
Line (a) for the simple picture, line (b) term’ should be added to (2b) 


for e REN pee 0 also. An estimate of the real 
taki accoun e correlation : 

energy for the non-magnetic state, line energy difference between the 
(e) estimated correct. non-magnetic state and the 


ferro-magnetic state is given 
in Fig. 1 (curve e). We shall see later that for very large r, em — En 
goes to zero exponentially. Thus the inclusion of the correlation energy 
will cancel only a part (though probably the major part) of the effect of 
the exchange energy. It must further be emphasised that one should 
consider also states with intermediate momenta.?° 


3. 


The next question is the density of energy levels. The magnitude 
of this determines a great many effects, such as paramagnetism,” specific 
heat at low temperatures; 2% moreover, conductivity and optical pro- 
perties are crucially dependent on it.!? If the effect of the zones is 
negligible, the difference between the energies of the states with the 
wave numbers k and &,, is given by 


h3 | 
e(k) = z (K — ka) ee 3) 


°E. Wigner, Physic. Rev., 1934, 46, 1002. 

* Cf. J. H. Van Vleck, ibid., 1937, 53, 1178, and further literature given there. 
105, Schubin and S. Wonsowsky, Proc. Roy. Soc., A, 1934, 148, 159. 

n H, Fröhlich,! $ 27. E. C. Stoner, Proc. Roy. Soc., A, 1936, 154, 656. 

18 Cf, +8 Mott and Jones,! p. 178. 

13 Cf. Mott and Jones,? Chapters VII and ITI. 
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where ka will denote the wave. number of the highest state occupied at 
the absolute zero temperature. From (3) the specific heat and hence 
the logarithm of the level density of the whole metal can be calculated 
by the methods of Bethe and Bardeen. The latter becomes 


In p (€) = 1-62nr , (e/nRy)t ; . (4) 


Taking into account the exchange energy, Dirac 15 obtains instead 
of (3) 


h Mk Jk kp | 
e(k) = F ka) + e—a BE - (5) 


This equation gives (for not too high k) a stronger dependence of e on 
k, than (3), and hence a much lower level density. For k = kwm, the 
derivative of e with respect to k even becomes infinite. It must be 
remembered, however, that it was assumed in the derivation of (5) that 
the electrons fill up all the levels with smaller wave numbers than kp, 
except for the electron under consideration, the wave number of which 
is k.. Since at the temperature T, the top of the Fermi distribution is 
fuzzed out to a distance of about AT in the energy scale, (5) will be ex- 
pected to fail if e is not large compared with KT. 

For e ~ kT, one must take into account the fact that some levels 
with a larger wave number than k, are occupied, while others with a 
smaller wave number are unoccupied. Denoting by s(k) the probability 
that the level k is occupied, we have at low temperatures ($ = 1/kT). 


I 
(k) = awy + - +: 


On the other hand, the energy e(k) is no longer given by (5) but by 
h? I n fe 
c(t) + elku) = Ek — J = v(k‘) | ZRH  . (62) 


where the last term contains the exchange integrals of the wave function 
with wave number k with all occupied states. Equations (6) and (6a) 
give a “self-consistent ° occupation probability v(k) and energy e(k). 
Both v(k) and e(k) depend on the temperature, the latter because the 
electron & is under the influence of the most probable distribution of the 
electrons, which depends on the temperature. Bardeen,!* to whom this 
consideration is due, has solved (6) and (6a) approximately and obtained 
also the level density for k = km. This decreased logarithmically with 
decreasing temperature, t.e. with decreasing total energy of the system. 
The temperature dependent part of the total energy is affected also: it 
is smaller than Sommerfeld’s value by the factor 


e'm 2e? k m 
I + Thik, In Zr =] + 2°157, — (r,/6) In Tr, š : (7) 


Equation (7) is not exact but is sufficiently accurate for temperatures 
between I and 100 degrees absolute. It bears on the comparison of 
observed and calculated electronic specific heats, given by Jones and 
Mott.7 The disagreement between calculated and observed values 


u H. A. Bethe, Physic. Rev., 1936, 50, 332; J. Bardeen, ibid., 1937, 51, 799; 
H. A. Bethe, Rev. Mod. Physics, 1937. 9, 69, § 53. There is a misprint in equ. 
(284), the exponent of U*+4/a should be 1/(2). 

18 P, A. M. Dirac, Proc. Camb. Phil. Soc., 1930, 26, 376; L. Brillouin, ref. 3, 
also J. Bardeen, Physic. Rev., 1936, 49, 653. 

1¢ J. Bardeen, ibid., 1936, 50, 1098, and unpublished work. 

1? H. Jones and N. F. Mott, Proc. Roy. Soc., A, 1937, 162, 49. 
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becomes several times greater and the excellent agreement for Cu seems 
to be lost. However, as mentioned in the first section, the correlations 
neglected in Hartree’s picture tend to increase the level density and the 
specific heat again and thus restore the agreement between calculated 
and observed values. They will be considered next. 


4. 


We shall now investigate the way in which an improvement of the 
wave functions beyond the Hartree-Fock scheme will affect the density 
of energy levels. I can see but two ways, both involving very rough 
approximations, of obtaining information on this question. 

(1) Apply the method used previously for the lowest state of the 
metal ® to excited states. The wave function obtained in this way is 
certainly only very approximate. In particular, even for the ground 
state (for which the solution of the Hartree-Fock equation belongs to 
a single multiplet system (viz., the singlet system) the improved wave 
function is a linear combination of wave functions for several multiplet 
systems (all of which are low, however). This is true also for the excited 
states. On the other hand, the wave function behaves qualitatively in 
the expected way, j.e., the electrons with antiparallel spins try to keep 
apart also. . This method of calculation is quite laborious, however, and 
I shall not go into it here. 

(2) The improvement of the wave function is made in two steps: 
(a) Build wave functions out of the Hartree-Fock solutions,}§ which 
have the proper symmetry, i.e. belong to a definite multiplet system. 
(b) Combine many wave functions with the same symmetry to obtain 
a better wave function. Clearly the first step is much easier than the 
second. It may be remarked that the first step is unnecessary for the 
lowest state; here the wave function has already a singlet character. 
Nevertheless, the second step gives a very great improvement in the 
energy. | 

Let us consider, first, a state 1° in which 2s orbits, all close to the top 
of the Fermi distribution, are singly occupied. We have, on the whole, 
2% wave functions. From these, at least all with an equal number of 

spins in both directions (total Z component of spin zero) were given the 


R ; f 2s . 
same energy in the previous calculation. Of these (5) functions 


Gee we teren . . (8) 


have the multiplicity S. Those with a higher multiplicity will be pulled 
in general lower, since their energy must be equal to the energy of a state 
in which $2 + S spins are directed one way, 4n — S the other way. 
For these partially magnetic states, a correction such as that in (2b) 
must be introduced, which amounts to — 0-916. 85?/(onr,). Thus 


the (z) states which had the same energy in the Hartree-Fock approxi- 


mation spread out. This will doubtless increase the density of levels 
everywhere, since the Hartree-Fock density is an extremely rapidly 


28 T.e., out of Slater determinants of plane waves. 

1° Similar considerations have been carried out for nuclei by J. Bardeen and 
E. Feenberg. I am much indebted to these authors for a discussion of their 
results. 
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increasing function and every energy value will gain more from its higher 
energy neighbour than it loses. However, the average spread is only 


8 0-016 [sis + He=#dS _80916 s 
o9 nr, [es + tye sl dS 9 r, n 


which is essentially independent of the number of particles. Thus, for 
macroscopic bodies, the first step in the improvement of the wave function 
does not increase the level density sufficiently to change the specific 
heat. This is because we have assumed a spreading only as required 
by the symmetry of electron exchange, which materially affects states 
with large S only. These however are scarce. 

If we improve our wave functions further by forming linear com- 
binations of wave functions with the same symmetry, we shall expect 
a spreading of levels of all multiplicities. Especially, the interaction 
between levels of different configurations now becomes important—the 
more important, the larger the crystal. At present, there is no well- 
developed method for taking all these interactions into account. Their 
effect can, however, be readily visualised in the case of a very strongly 
expanded lattice, and may be expected to be in the same direction in the 
case of ordinary lattice constants. 

Of course, if one could really expand a lattice, the band structure of 
the electronic states would become more and more pronounced and our 
original assumption become quite unjustified. For an orientation about 
the problem in hand it is equally useful, however, to replace the positive 
ions by a uniform space charge first (which, in many cases, does not alter 
matters appreciably) and then to expand this space charge. Thus our 
expanded lattice will not be a lattice at all and will in many ways have 
very little similarity to an actual metal. On the other hand, it can 
readily be treated mathematically and it may also be expected that the 
effect of the above-mentioned improvement of the wave functions will 
have a similar effect on the level density in this model as in actual crystals. 

For a very large lattice constant, the kinetic energy of the electrons 
will play only a secondary réle. As a first approximation, the electrons 
will assume definite positions such that their potential energy will be 
aminimum. This is the case for a regular lattice arrangement, probably 
the body-centered lattice. As a second approximation, the electrons 
will move a little around their equilibrium positions. 

We are thus led to consider a model corresponding to an “ inverted 
alkali metal.” The positive charges are uniformly distributed (because 
we know that the effect of the ions can be replaced in alkalis by such a 
charge distribution), while the negative electrons are localised because of 
their repulsive action. For the potential on one electron, only the effect 
of the positive space charge of the s-sphere surrounding this electron 
has to be taken into account, the effect of the rest of positive space 
charges being very nearly cancelled by the other electrons.™ The 
electron thus moves in the field 


(— 3/r, +F jr”) Ry + . . (10) 
(r denotes the distance from the centre of the sphere in Bohr units), :.e., 


- (9) 


2° This calculation assumes S < s << m, i.s., that we have no ferromagnetism, 
The situation is the same, however, for the ferromagnetic state also. 

sı K, Fuchs, Proc. Rey. Soc., A, 1935, 181, 585. 

n E, Wigner and F. Seitz, Physic. Rev., 1934, 46, 509: K. Fuchs.3? 


Electron Interaction on the Energy Levels of Electrons in Metals 437 


684 ENERGY LEVELS OF ELECTRONS IN METALS 


it forms an oscillator with the frequency Ry/rhr ls. The energy in the 
lowest state is 


(—3/r,+ 3/re")Ry . . (11) 
while the wave function becomes 
y = (ajr) henh; a = ijrèh.. . (12) 


Of course, this picture is only consistent if the wave function (12) has 
practically dropped to zero at the surface of the s-sphere, i.e., if 
art=r,h>2. A somewhat more detailed investigation of the con- 
ditions of validity of our picture shows that it must be quite accurate 
for r, ~ 10 and that even for r, ~ 6 it is probably more nearly true than 
we are used to think. This is partly borne out by the magnitude of 
the correlation energy, and means that the electron gas under this 
density is already rather far from the ideal condition and more nearly 
a liquid with high compressibility than a gas. 

In the treatment just given we have neglected the effect of the dis- 
placement of one electron on the field acting in another s-sphere. This 
treatment corresponds to Einstein’s treatment of the solid body, while 
the cross terms between displacements are taken into account only in 
Debye’s, Born-Karman’s, and Blackman’s theories. This would yield 
a somewhat lower ?* value for the zero-point energy, and a somewhat 
different expression for the wave function, since in (12) the normal 
co-ordinates would appear in the exponent, with coefficients « propor- 
tional to the corresponding frequencies. However, the frequency spec- 
trum of our model is rather different from the frequency spectrum of 
ordinary solids. 

It may be interesting to note that for r, ~ 6 (II) gives a value for 
the correlation energy which is one-half of that of the previous calcula- 
tion ?5 although the wave function (12) is certainly still very inaccurate 
for this radius. 

Of course, the foregoing considerations are valid only for very large 
r,, especially only if the % of (12) is already practically zero for r = 7,. 
However, the (positive) gnergy change necessary to accomplish this 
boundary correction goes exponentially to zero with 7,/1. Apart from 
such a correction we have in (12) in fact n | solutions of Schrödinger’s 
equation, as every electron can be allocated to every lattice point. On 
account of Pauli's principle, this means that we have, introducing the 
spin, 2" wave functions with approximately the same energy, giving 


(ns) ac -S4 ı) 


terms of multiplicity S. This shows that for large r,, the energy dif- 
ference between the ferromagnetic and non-magnetic states decreases 


»® The electron gas differs rather curiously from an ordinary gas in so far as 
in the latter the potential between the atoms can be neglected if the density is 
small. In the electron gas, the greater the density, the more perfect the degen- 
eracy and the behaviour becomes ideal for high densities. 

34 One can take this into account by a method due to F. London, Z. Physik, 
1930, 63, 245, which gives 2-7 instead of the second 3 in (rr). 

** The correlation energy is obtained from (11) by subtracting it from the 
energy, calculated for the same density, by the Hartree-Fock model. The latter 
contains three parts: the electrostatic energy (— 1-2/r,), the zero point energy 
2-21/r,*, and the exchange energy (— o-916/r,). On account of the somewhat 
different picture used here, the low density limit of the correlation energy is 
somewhat different from that obtained in ref. 8. For 7, = 6 and r, = 8, (11) 
gives 0-15/r, and o-21/r,, while the estimates of ref. 8 are 0°32/r, and 0-36/r,. 
For these radii, equation (r1) cannot yet be expected to be valid. 
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much more rapidly than appears from (25), (1c) demonstrating the exist- 
ence of a substantial correlation energy for the ferromagnetic state also. 

One can calculate from the frequency of the electron vibrations, 
using Debye’s theory, their specific heat, and from this the density of 
energy levels, following Bethe's and Bardeen’s procedure. One obtains 
for the logarithm of the density 


In p(e) ~ n{In2 + 2-27, (e/n Rybi) ; . (13) 
where n is the total number of electrons, and the term n In 2 arises from 
the above-mentioned 2" fold approximate degeneracy. Comparing (13) 
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Fic. 2.—Schematic picture of the behaviour of energy levels for changing r,- 
The density of the lines of the figure corresponds to the logarithm of the 
energy level density of the whole metal or to the density of one particle 
levels. Figure (a) for the simple picture, (b) for the Hartree-Fock picture, 
(c) for the picture with correlations. 


even with (4) shows clearly that, at any rate for large lattice constants, 
the inclusion of correlations increases the density of levels. The same 
can be expected for the real lattice constants also, and the effect of 
improving the wave functions beyond the Hartree-Fock scheme partly 
counterbalances the effect of the exchange energy. 

Fig. 2 shows diagrammatically the behaviour of 7,? times the energy, 
as a function of r, for the three models. Although, of course, the 
irregularities indicated in the third picture come in only at those values 
of r, for which there is no practical example, it would not be surprising 
to find some anomalies in the electronic specific heat of alkalis at low 
temperatures. However, the effect of the exchange must still be 
expected to predominate in these cases. 

I am very much indebted to the Wisconsin Alumni Research Founda- 
tion for the support which they have given my research. 
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1. Introduction. 


The subject on which I wish to speak has occupied me on and off for many 
years. It consists only of qualitative considerations and much of what follows will 
not be new to those who are actively working in the field. The occasion to present 
these ideas was chosen not only because of the kind invitation extended by the 
Science Council of Japan but also because some recent quantitative work has given 
some strength to the rather crude ideas which will be described below. I am re- 
ferring particularly to Wohlfarth’s and Fletcher’s work” on the structure of the 
Brillouin zones of the 3d shell and to the doctoral dissertation of F. Stern in 
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Princeton. 


Before starting, let me make a few comments on the purpose of this and similar 
work. If I had a great calculating machine, I would perhaps apply it to the 
Schrödinger equation of each metal and obtain its cohesive energy, its lattice 
constant, etc. It is not clear, however, that I would gain a great deal by this. 
Presumably, all the results would agree with the experimental values and not 
much would be learned from the calculation. What would be preferable, instead, 
would be a vivid picture of the behavior of the wave function, a simple descrip- 
tion of the essence of metallic cohesion and an understanding of the causes of 
its variation from element to element. Hence the task which is before us is not 
a purely scientific one; it is partly pedagogic. Nor can its solution be unique: 
the same wave function can be depicted in a variety of ways (just as a cubic 
close-packed lattice can), the same energy can be decomposed in a variety of ways 
into different basic constituents. Hence the value of any contribution to the pro- 
blem will depend on the taste of the reader. In fact, from the point of view of 
the present article, the principal purpose of accurate calculations is to assure us 
that nothing truly significant has been overlooked. 


Recent work, in particular Korringa’s® and the one which was just reported”, 
gave the theory of the cohesion of alkalis a simplicity and roundedness which will 
be difficult to surpass. More important, the latter one derives the properties of 
the alkali atoms, which are used in the calculation of the cohesive energy, directly 
from the observed spectra. One is tempted to ask, therefore, what inferences one 
can draw from the spectra of more complicated atoms on their cohesive properties. 
This, in a sense, is the thesis of my talk, but before broaching my subject proper, 
I wish to recall a few figures from the theory of the alkali metals. Let us take 
therefore a glance at the calculations. 


The cohesive energy is repesented as the sum of three quantities. The first I 
shall call boundary correction. It gives the difference between the binding energy 
of the valence electron of the free atom and the bottom of the Brillouin zone in 
the crystal. It is the direct result of the circumstance that the radial derivative 
of the crystal wave function vanishes at the cell boundary while it must be, at the 
same distance, negative in the atom. Hence its name. Its magnitude is about 72 
calories/mol in the case of sodium. The second part of the cohesive energy is the 
kinetic energy of the free electrons (Fermi energy). This amounts to about 43 
calories in Na. The last part are the contributions from the various “holes”: the 
circumstance that. the atomic electron is always in the field of a charged ion, 
i.e. in a field of positive potential, has its counterpart in the metal in the “holes ” 
of electron density which the electrons carry around themselves and which result 
from statistical correlation between the positions of the electrons. The net total 
of this last contribution to the cohesive energy is very small and it became 
customary to neglect it. Thus the cohesive energy of Na is 72—43 = 29 calories/mole. 
The ist Figure illustrates this situation. It shows, in particular, that the 
energy of the electron at the top of the Fermi distribution is just about equal to 
the energy of the electron in the free atom. Furthermore, if one assumes that 
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the energy is a quadratic function of the momentum throughout the whole Brillouin 
zone, the kinetic energy of the electrons would just compensate the boundary 
correction and no binding would result. These results, obtained in the first place 
for sodium, can be considerably generalised and form the basis of the second of 
the following observations. 


MW 


FREE ATOM Ä N u > 
Yi zu NE: z5 


Fig. 1. Comparison of the various quantities contributing to the cohesive energy 
(C.E.) of Na. The boundary correction (B.C.) determines the bottom of the 
Fermi zonė. The electrons at the top of the Fermi zone (the occupied part 
of the Brillouin zone) have just about the same energy as the electron in 
the atom. The average energy of the whole Brillouin zone would also 
be very nearly equal to the energy of the atomic electrons. Hence, no 
binding would result if the whole zone were full. 


2. Preliminary Observations. 


1. The multitude of allotropic transformations with very low energy of tran- 
sition shows that the cohesive energy is reasonably independent of the crystal 
structure. The following considerations could be based, therefore, on any one of 
the common crystal structures, and the body centered one was chosen because the 
Brillouin zones show the greatest symmetry for this structure. In most cases, I 


have convinced myself that the results would be quite similar for a face-centered 
lattice. | 


2. If one builds up the wave functions of a Brillouin zone by properly super- 
posing atomic wave functions centered at the different lattice points 


Pz = Deny (F—G,) (1) 


(a, are the different sites of atoms, Y the atomic wave function) the average energy 
of all the electrons of the Brillouin zone is greater than the energy of the electron 
in the free atom. This is what the right side of Figure 1 illustrates. It is also 
contained in Heitler and London’s valence theory’s statement that closed shells 
repell each other. It means, in our language, that the boundary correction is 
smaller in the approximation of (1) than the average Fermi energy of the zone and 
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it is not difficult to see this also directly. Actually, of course, the implication of 
this observation. is well borne out in the case of the noble gases; the origin of 
the cohesion of these elements cannot be described in the individual particle pic- 
ture. It is not borne out at all in the earth alkali elements all of which have in 
fact larger cohesive energy than the alkalis — a behavior opposite to the one 
which would be expected from Figure 1. It will be necessary, therefore, to modify 
the underlying picture rather drastically. 


This modification shall consist, from a mathematical point of view, in considering 
boundary correction and Fermi energy together as a boundary correction. This 
has been done, of course, before, in particular lately by W. Kohn®. The principal 
point which I shall stress is, from the physical point of view, the effect of the 
excited states of the atom on the band structure of the metal. 


3. The third observation relates to the extent to which the effect of the ion’s 
charge on the atomic electron is compensated by the effect of the various “ holes ” 
in the metal. We have seen that this compensation is nearly perfect in one electron 
systems. To be sure, the absolute value of the potential, where it is particularly 
large, is larger in the atom. On the other hand, the electron carries along its 
particular potential hole wherever it travels in the metal and this hole, though 
more diffuse than the positive charge of the ion, is right there where it is needed. 


The compensation of the holes cannot be expected to prevail in the case of 
elements the structure of which is more complicated, particularly if the normal 
state’s configuration gives rise to levels of different multiplicities (such as 2). In 
these cases there is a “hole” around each electron already in the atom which 
follows it around. The correlation between the positions of electrons is weak if 
there are only two valence electrons but becomes stronger as the electron number 
increases*. It manifests itself also in the difference between the normal state of 


* There is no correlation, in the Hartree-Fock picture, between the distances of the various 
electrons from the nucleus. As long as the number of electrons does not exceed 2/+1, the 
average probability, for all states of highest multiplicity, for an angle & between the directions 
of any two electrons is proportional to [ (27+1) /2/][1—P, (cos v)2]. In this case, all the electron 
spins are parallel. If the number n of the electrons exceeds 2/+1, their spins cannot be parallel 
any more. In this case, the probability for an angle wv, averaged over all states of highest 
multiplicity, becomes the average of the above expression with the weight 3% (2/+1) (2) + 
34 (m—-21—1) (n—21-—2) and of 1 with the weight (2/+1) (a—2/-]). If the position of one electron 
is fixed, the distribution of the other electrons has a quadrupole moment in the direction fixed 
by the first electron. This does not correspond to a very close correlation. 

In particular, in the first case mentioned above, the probability vanishes for two electrons 
lying on the same straight line through the nucleus no matter whether they lie on the same 
or on opposite sides thereof. Correlations which go beyond those given by the Hartree-Fock 
picture will surely improve on this situation and furnish, in addition to the quadrupole moment 
mentioned above, a dipole moment in the distribution of the electrons around a given electron. 
However, the total effect of correlations in the atom will remain lower than it is in the metal 
because the correlations in the metal give not a quadrupole or dipole moment, but a net posi- 
tive charge around any givenelectron. This positive charge is being balanced, in the present 
method of bookkeeping, against the equally large net positive charge of the ion plus the effect 
of the quadrupole and other moments described above. (Note added December 1953) 
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the atom, and the energy calculated in the orthodox Hartree fashion; it is given 
by the Slater integral. This results in a more effective hole in the charge distri- 
bution around the atomic electron than around the metal electron. It has two 
effects. First, the binding due to the p and d electrons decreases as their number 
increases, on this account as well as on account of their increasing kinetic energy. 
Second, the picture based on the independent particle model with traveling (Bloch) 
waves becomes inaccurate toward the middle of the d and even the p shell. It is 
indeed very unlikely that, in the case of half filled shells, the electrons in the. metal 
should not be able to correlate their positions better than the free electron picture 
would indicate and that they could not approximate the correlations Present in 
the atom. I shall, indeed, show wave functions which do this. 


There is, of course, no effect of the type here considered in the s shell and the 
effect virtually disappears also for closed shells. The third observation is summar- 
ized in Table 1. 


Table 1. Hole Energies (cf. Appendix) 


en 


s —0.6e?/r; +0.458e?/r,+C’ = -0.142e2/r,+C’ 


s2? ~ 2 x 0.6e2?/r;, —C +2 x 0.458 x 22/7, +2C’ en 0.045e2/r,+2C’ -C 
we) } -0.6ne2/rs- In (n-1)/2+0.458n n !ae2/rs+]’n2/6-HnC’ 


— 0.28e?/7; + J/6 +C’ 

— 0.39e2/r, - J/3 + 2C” 
—0.42e?/rs —3J/2+ 3C” 
—0.40e2/r,—-J/3 
—0.25e?/rs+J/6 

= 0.046e2/ Ts 


a a a 


4. The last observation only records the fact that in very heavy atoms the 
magnetic interaction becomes comparable with the cohesive energy. This makes 
it desirable to restrict our considerations to the cases in which the magnetic in- 
teraction is not too large. 


3. The Structure and Width of p and d bands. 


The second of the above observations shows that, as long as the wave function 
can be assumed to have the form (1), the bands are centered at the value of the 
atomic level. The binding will be, in this approximation, largest for the first 
electron of each shell, will be about zero for the electron in the middle of the 
shell and will be negative for the electrons in the second half of the shell. This 
means that the cohesive energy should be largest at the middle of the shell and 
decrease toward both sides of it. 


Figure 2 gives the cohesive energies of all solids as function of the charge 
number Z. One sees that the behavior as described above obtains at least approx- 
imately in the 35 and 4p shells — the behavior of the cohesive energy as function of 
Z is indeed very similar in these shells. The situation seems somewhat more in- 
volved in the 3d and 4d shells which are also rather similar to each other. They 
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differ from the above expectations principally in giving a non negligible binding 
for the closed shells — Zn and Cd. The cohesion in solid carbon is too large to 
conform with the above picture and this is one of the facts for which I shall be 
unable to tender any explanation. It was mentioned before that the binding 
energy of the earth alkalis is larger than that of the alkalis which directly con- 
tradicts the above picture. Nevertheless, the degree of agreement makes it desirable 
to arrive at some more detailed picture for bands of electrons with wave functions 
of the form (1). 
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Fig. 22 The full line gives the cohesive energy of the condensed state in kcal as 
function of Z. The electronic shells are marked on the horizontal axis. The 
broken line gives one half of the first ionization potential for comparison. 


This can be done, of course, on the basis of detailed calculations and such 
calculations have been carried out repeatedly, most recently for the d shell by 
Wohlfarth and Fletcher”. Let me consider, therefore, the p band of a body centered 
lattice. The three wave functions have the same energy for k = 0, i.e. at the center 
of the Brillouin zone and Figure 3a shows the sign of the wave function at the 
boundaries of the cells in the (110) plane. One sees that on most boundaries the 
wave function originally has opposite signs on the two sides. This means that 
the wave function will actually vanish at these boundaries, i.e. the boundary cor- 
rection is positive in this case and the k=(0,0,0) point lies at the top rather than 
the bottom of the Brillouin zone. This illustrates, by the way, why it was desirable 
to consider Fermi energy and Boundary correction together: they have, at least 
in this case, little meaning separately. | 


Let us consider next a point at the surface of the Brillouin zone and choose, 
first, the point at which the three branches again coincide (Figure 3b). One sees 
that the fit is, in this case, by no means perfect (as it is for s electrons) but never- 
theless sufficiently good to give a very substantial negative boundary correction. 
This point therefore lies near the bottom of the band. 
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Fig. 3a. Section through a body centered lattice parallel to the (110) plane. The 
crosses at the centers of the hexagons give the positions of the nuclei. The 
hexagons are the intersections with the polyhedra surrounding the atoms. 
The + and — signs at the hexagons give the sign of a p wave function of 
an atom which is supposed to be at the center of the hexagon. In this 
Figure, the p wave functions of all the atoms have the same phase corre- 
sponding to a wave vector k= (0,0,0). Over most of the boundary, the sign 
of the atomic wave functions is opposite on the two sides of the boundary. 
This indicates poor fit and high energy of the k= (0, 0,0) state. The Figure 
gives only one p wave function. However, the situation is the same with 
the other two. 


Figs. 3b, 3b’. The arrangement is the same as in Figure 3a, except that the wave 
vector is k= (0,0, 27/a) where a is the lattice constant. It corresponds to 
a surface point of the Brillouin zone. For this &, the atomic wave functions 
of nearest neighbors have opposite sign at corresponding points. The three 
p functions give two energy values. Figure 3b gives the sign of the atomic 
wave functions at the boundary of the polyhedron for the wave function of 
one of these energy values, 3b’ for one of the wave functions of the other. 
The fit is rather good in both cases, indicating low energy values. 


Figure 3c shows another point at which three zones coincide — its boundary cor- 
rection is very smal] so that it must lie near the center of the band. Figure 3d, 
finally, illustrates the sign of the wave functions for a wave vector for which the 
three bands are separate. The fit is very poor for two of the wave functions in 
this case but is the best of all for one of them. Hence, the three energy surfaces 
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Figs. 3c, 3c’, Same as Figures 3b, 3b’ for the wave vector k= (m/a, x/a, w/a), another 
point on the surface of the Brillouin zone. The two atomic wave functions 
differ, over much of the boundary, by a factor % Hence the boundary 
correction is very small, the energy is close to the middle of the Brillouin 
zone. 


+ 


Figs. 3d, 3d’ Same as other Figures 3, for the wave vector k= (m/a, 2/a, 0), also a point 
on the surface of the Brillouin zone. The atomic wave functions fit together 
at the boundaries ofthe polyhedra very well in Fig. 3d. The energy which 
corresponds to the wave function represented is presumably the lowest in 
the zone. 


are widely separated for this wave vector (k = /a, z/a, 0), one of them forming, 
presumably, the bottom of the zone. 

The picture which we obtain for a p band which is well separated from all 
other bands and has wave functions of the form (1) is, therefore, as follows. There 
is no wave function with entirely perfect fit at the boundaries and the bottom of 
the band is, therefore, depressed less than in the s zone. The total width of the 
band is slightly smaller than that of an s band of similar energy and it contains 
three times more states than the latter. The density of states seems particularly 
large near the bottom of the band because the energy varies rather little between 
k = (0, 0,27/a) and k =(z/a,z/a,0) on one of the energy surfaces. 


The Brillouin zone due to d electrons — actually, there are two of them — can 
be discussed in the same way and the results are similar to those of Fletcher with- 
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Fig. 4. Schematic diagram giving the energy (in kcal/mole) as function of the wave 
number k in the ‘111) and (110) directions. The broken line gives the 
energy of a free electron, the full lines indicate how the existence of low 
lying excited states of the atom might influence the energy. The point Tı 
represents an s orbit, the points Py and N,’ p orbits, the points Tas’ and Ng 
represent d orbits. The wave functions for Py and N,’ are illustrated in 
Figures 3c, 3c’, and 3d. The p and d states increase the number of available 
states below about 100 Cal and thus decrease the Fermi energy. The line 
connecting F25’ and P, has a twofold degeneracy. The symbols F, P, etc. 
refer to the symmetry symbols given by Bouckaert, Smoluchowski and 
Wigner (Phys. Rev. 50 (1938), 58). 


out giving, of course, the quantitative results he obtained. The fit of the wave 
function which gives the lowest state is very good in this case. The total width 
of the two zones is, under the assumption of well separated zones, very nearly the 
same as in the case of an s zone. Naturally, the width of the zone will be much 
diminished if the lattice constant is kept larger than would correspond to the 
minimum of the energy of the d electrons alone. The electrons of the s shell 
which preceeds the d shell exert such an influence. 


Much of the qualitative behavior of the energy surfaces can be seen in the 
following way. Let us insert for the atomic wave funciion y in (1) the product 
of xy+xz+yz and a radial function. The y obtained in this way will be much 
larger in the (111) direction, i.e. in the direction of two of the nearest neighbors, 
than in the direction of any other neighbor. The bulk of the boundary correction 
wili arise, therefore, from the part of the surface which lies between the original 
atom and these two neighbors. The boundary correction will have the largest 
negative value if the glk, of (1) corresponding to the a, of these two neighbors, 
is equal to 1, i.e. if the wave vector k is perpendicular to the (1,1,1) direction. 
However, the correction will depend rather little on the component of k perpen- 
dicular to this direction so that one has a rather flat portion of the energy surface 
(the minimum apparently does not lie at k=(0, 0,0) as one might first suspect). 


One can further augment this lack of sensitivity of the energy on the com- 
ponents of k perpendicular to the direction of the maximum of k, by skipping, in 
(1), some of the lattice sites a» The resulting wave will not be a Bloch wave any 
more but, around certain k, the energy dependence on the: perpendicular 
component of k will be very slight indeed. One will be tempted, therefore, 
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to attribute the same spin direction to all these states thus recovering some 
of the correlations between electronic positions which lower the atomic energy 
and which are responsible for the J’s of Table 1. There will be, of course, other 
wave functions of the form (1) which contain those a, which were skipped in the 
wave function considered before. If the spin of the corresponding states have 
cpposite directions, no non-orthogonality corrections between the two types of wave 
functions will apply. It is conceivable that the state of the whole system based 
on such non-Bloch type wave functions is lower than the one based on Bloch type 
wave functions and one is also reminded faintly, of Zener’s theory of ferromag- 
netism”. It seems to me, however, that it would be premature to follow these 
ideas further at this time particularly since the variety of the non-Bloch type wave 
functions is so large. 


Returning to our principal line of discussion, it should be evident from what 
we know about the widths of Brillouin zones, in particular from Slater’s explanation 
of ferromagnetism, that the picture of well separated p and d zones is far from 
accurate. Rather, the p and d zones strongly overlap at least with the preceding 
szones. This causes them — particularly the d zones — to be much narrower because 
s electrons of energy equal to that of d electrons, have a much longer tail than 
the d electrons. The structure of the region between the s and the p and d zones 
might be similar to that given by Figure 4. This gives the energies of an s—p 
and s—d pair of zones with wave vectors along the (111) and (110) directions. 


It will be attempted now to interpret the cohesive energies, i.e. the contents of 
Figure 2, in terms of the above observations. 


4. Interpretation of cohesive energies. 


No attempt will be made here to reproduce the interpretation of the cohesion 
of alkalis beyond the remark that the decrease of the cohesive energy, in the Li- 
Na-K-Rb-Cs series, is principally due to the decreased boundary correction 


raa f ‚Vrgraduyds. (2) 


This in turn is due to the increasing size and decreasing ionization energy along 
the above series. 


The first real problem arises in connection with the cohesive energy of the 
earth alkalis which, as mentioned before, would vanish, were the s zone well 
separated from the following zones. Instead, the cohesion is invariably larger in 
the earth alkalis than in the preceding alkali. One can understand this in the 
following way®. 


The effective charge and hence the ionization energy is larger in the earth 
alkalis than in the alkalis. This increases the boundary correction (2). This, in 
itself, does not explain the cohesion because the Fermi energy should increase 
simultaneously. However, the Fermi energy increased in this way becomes much 
larger than the excitation energy of the next p electron (or d electron) even if 


Qualitative Theory of the Cohesion in Metals 449 


WIGNER: QUALITATIVE THEORY OF THE COHESION IN METALS 659: 


this is augmented by the difference of the boundary corrections of the p (or d) and 
s electrons. Because of the higher density of states in the p (and d) zones, as 
compared with the s zones, this puts an effective stop to the further increase of 
the Fermi energy and causes the boundary correction to prevail. 


In agreement with the above picture is the interpretation, given by Mott and 
Skinner, to Skinner’s experiments on the absorption of soft X rays. They find that 
the top of the s zone has s character in alkalis but character in earth alkalis. 
It is also interesting to follow, from this point of view, the ratio of the cohesive 
energies of alkalis and earth alkalis. This is greatest in the Be-Li.pair in which 
the ratio of p excitation energy to boundary correction is smallest. In the Mg-Na 
pair the difference between p and s levels is relatively larger and, hence, the ratio 
of cohesive energies is much smaller. The cohesive energy ratio is again large for 
the Ca-K pair either because the level is very close in this case or because the 
d level, with its relatively large boundary correction, is also quite close. The same 
is true for the Sr-Rb pair although, in this case, the d zone is more likely to be 
involved than the d zone. 


The weakness of the above description of the cohesion in the earth alkalis is 
that it presents no general reason for this cohesion exceeding that of the preceding 
alkali. On the other hand, it is well to remember that the hole correction (Table 
1) is positive in this and only this case. 


Let us go over now to the consideration of the p shell. Figure 4 already 
indicates that if the p shell follows an s shell, part of its states are filled already 
when the $ configuration is reached. The same is true, presumably, if the s? shell 
is followed by a d shell. Furthermore, Table 1 shows that the hole correction 
becomes strongly negative when the 2° configuration is reached. One will expect, 
therefore, that the third p electron will already decrease the cohesive energy —a 
phenomenon which the naive theory would expect to happen at the fourth p 
electron. In fact, in every case excepting that of C, maximum binding is reached 
with the #? configuration. As was mentioned before, no explanation is being offered 
for the exceptionally large cohesive energy of C. 


Considerations quite similar to the above ones apply to the d shell. Maximum 
cohesion is reached in both d shells at the d’! configuration. However, in contrast 
to the p shell, addition of further d electrons leaves the solid metallic. As a result, 
one would expect, on the basis of Table 1, a certain recovery of the cohesion just 
beyond the half shell. i.e. at d°. In fact, a secondary maximum appears in both 
d shells at this point which may be an indication herefor. | 


Because of the low d-p energy difference in the atom, the p state may play 
some role already while the d shell is being filled. If so, one sees little evidence 
therefor: the maxima and minima come at the points where one would expect them 
without the interference of the p shell. Only the non negligible, though small 
cohesive energy of the d° configurations (Zn and Cd) gives evidence of an inter- 
action between p and d states. | 


450 Part II. Solid State Physics 


660 METALS 


5. Summary. 


There are four points which emerge from the above analysis and which can 
be stated very simply, 
1. Configurations of the atom which have a low excitation energy increase the 
cohesion in the solid. | 


2. The various zones, s,d,d are rarely separate. Usually, they overlap, hang 


together at boundary points of the Brillouin zone, etc. 


3. As a result of this circumstance, the Fermi energy is relatively smaller and 
less important in transition metals and even in the earth alkalis than in alkalis. 


4. On the contrary, the hole corrections cannot be expected to cancel in 
general as closely as they do in alkalis and they are responsible at least for some 
of the fluctuations of the cohesive energy. 


Appendix (added December 1953) 


CALCULATION OF THE HOLE ENERGIES FOR TABLE 1. 

The calculation leading to Table 1 closely follows that presented by F. Seitz 
and the present writer in an earlier article”. In first approximation, the potential 
in each cell is assumed to be the same as if the atom, which is at the center of 
the cell, were the only atom present. The wave functions are supposed to satisfy, 
on the surface of the cell, the proper metallic boundary conditions. The energy 
is then calculated for the wave functions so obtained taking into account all inter- 
actions, such as the effect of the atoms outside of the cell, the interaction of all 
the electrons, etc. The term “hole correction” means the difference between the 
energy evaluated in this way and the energy obtained under the assumptions of 
the first approximation, i.e. atomic field with metallic boundary conditions. This 
latter energy includes the zero point energy of the the free electrons so that this 
quantity does not appear in Table 1. The enumeration of the terms which follow 
applies to the third line of the Table, leading to the corrections for n<3 p electrons. 
The other entries in the Table can be obtained in the same way. Further details 
and a more complete justification of the procedure can be found in the aforemen- 
tioned article”. 


The electrons move, in the free atom, in the field of an ion. The charge of 
this ion is neutralized in the metal by the free electrons. Putting it differently: 
removal of a single electron does not change the potential in the metal and this 
remains the same as in a neutral atom. The difference between the two potentials 
is approximated by that of a uniform charge distribution, giving an average po- 
tential of 1.2e?/7, and an average potential energy of 1.2 e?/7, per electron. The 
total energy is one half of the product of charge and average potential. This gives 
the term —0.6 ne?/r. 


The atomic energy includes, in the Hartree-Fock approximation, not only the 
interaction of the charges but also certain exchange integrals which cause the 
difference between the various. terms arising from the same configuration®. The 
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origin of these terms are the correlations which are described in footnote om 
page 652. Strictly speaking, one should compare the energy of the metal with that. 
of the lowest state ofthe atom. Instead, the average energy of all terms of highest 
multiplicity is considered. This does not differ too much from the energy of the 
lowest term (it is, actually, the same in the shell) and is much easier to calculate. 
One easily finds that it is equal, per electron pair, to the average value of 


Ts = [ae (2)* Zap, (1) hu (2) d1d2, (Al) 


where vw, and y, are two wave functions of the p shell with magnetic quantum 
numbers yw and v. The average is to be taken over all p,» combination: 


2 oe | | 
J= (IFI) Za: (A2) 
This J must be multiplied with the number of pairs of electrons, n(m—1)/2, and 
given the negative sign because it appears in the energy of the atom, rather than 
the metal. No term similar to (A2) appears in the calculation of the cohesion of 
alkali metals” because m=1 in that case. 


The above contributions, which decrease the binding energy, are compensated 
at least partially by the result of the correlations in the metal. We remain again 
within the Hartree-Fock picture, i.e. calculate only the exchange terms. When 
calculating these, we have to remember that p electrons give rise to three energy 
surfaces, i.e. three zones, in momentum space. These energy surfaces coincide at 
certain points and even lines but are distinct for a general wave vector. It appears 
convenient, therefore, to distinguish between exchange integrals involving wave 
functions of the same zone and exchange integrals involving different zones. The 
first one can be particularly easily calculated if one assumes that one has to deal 
with plane waves and becomes 


0.458 è y 


(A3) 


per electron if there are altogether n’ electrons per atom in the zone”. This n’ 
was 1 for alkalis because there was only one zone and one electron per atom; in 
the present case n'=n/3 because the n electrons are distributed to three zones. 
(A3) gives the contribution per electron, hence the factor m in the term 
0.458 nn’%e?/r, of Table 1. 


The exchange integral between electrons of different zones will be denoted by 
J’. If the wave functions are given by expressions of the form (1), J’ can be 
calculated along the lines given in reference” and is then approximately equal 
to J. It appears only between electrons with parallel spin and for these only if 
they are in different zones. Half of the electrons have their spin parallel to a 
given electron and 3 of these are not in the zone of the given electron. This 
would give a factor 19(3g)m” but since one counts every interaction twice in this 
way, the factor of J’ is /6 per electron or n?/6 altogether. No expression similar 
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to J’n*/6 applies in the case of alkalis because there is only one zone in this case. 
Hence, this term has no counterpart in earlier calculations”. 


The quantities C and C’ stand for the energy due to correlations which go 
beyond the Hartree-Fock picture and which will not be described in further detail. 
C refers to the atom, C’ for the metal; the latter is probably much greater than 
the former. 


The fourth and successive lines of the Table give the values of the third line 
for n=1,2,3 and of a similar expression for n=4,5,6 which applies when the p 
shell is more than half full. To simplify the final expression, J=J’ was set in the 
last six lines and C and C’ omitted in the last three. Even though the above 
consideration must be regarded as extremely crude and sketchy, it is believed that 
a large part of the errors committed would compensate. 


The Table shows clearly how the “hole correction ” increases up to the middle 
of the shell and then decreases again. At the middle of the shell, it contributes 
a substantial correction which decreases the cohesive energy calculated in the first 
approximation. On the other hand, the hole correction practically vanishes at the 
end of the shell. 
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DISCUSSION 


F. Seitz: What is the broken line ? 


Wigner: The broken line in Figure 2 represents the ionization energy. It is 
given in the Figure because it illustrates an interesting relation: the ionization 
energy, as function of the atomic member Z, is usually convex where the cohesive 
energy is concave, and vice versa. Such a behavior could be expected from the 
picture on which I was reporting. 


N. F. Mott: What is J which you just mentioned ? 


Wigner: J is the exchange energy, the Slater exchange integral between two 
p functions of the same atom. It is principally responsible for the energy diffe- 
rence between the triplet and singlet states of the two electron system and, more 
generally, for the energy difference between states of higher and lower multiplici- 
ties in many electron systems. If we denote the three p functions by v., Wo, W-s 
it is given analytically by 


2 
J= wma rn Dy. @dıa2. 
More generally and more accurately, I mean by j the average value 
___2 e 
J =a pn X [ODE Ov @aia, 


where u and v run from —/ to / and denote the magnetic quantum number, i.e. 
the projection of the orbital angular momentum into a fixed direction. 


J. H. Van Vleck: Whats the significance of J’? 


Wigner: J’ is the same quantity in the metal that was denoted by J in the 
atom. Its value may be different from J because it is calculated with metallic 
rather than atomic wave functions. However, the difference may not be too great 
and the last part of the Table assumes that J and J’ are equal. 


Van Vleck: What are C and C’? 


Wigner: These are the correlation energies in the atom and in the metal, 
i.e. they stand for an estimate of that part of the energy which is neglected in 
the Hartree-Fock picture. It appears that C’ is considerably greater than C and 
the last part of the Table omits C. 
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